Commits · Xenobd/whisper.cpp

docker : add libsdl2-dev for container builds (#2424)

aa93432
unverified

JohnnyB commited on Sep 20, 2024

go : add tests and update bindings (#2425)

c80d17a
unverified

Stavros Panakakis commited on Sep 20, 2024

server : use OS-generated temp file name for converted files (#2419)

04d9c8d
unverified

teejae commited on Sep 17, 2024

go : fix CUDA build (#2416)

dafe96d
unverified

Binozo Binozo commited on Sep 15, 2024

cann : add Ascend NPU instructions (#2410)

ae9acd3
unverified

Mimi89757 commited on Sep 11, 2024

cmake: Fix libdir value in pkgconfig file (#2407)

a048ef3
unverified

Philippe Normand commited on Sep 7, 2024

revert : cmake : set MSVC to use UTF-8 on source files (#2346)

5e9ff52

ggerganov commited on Sep 2, 2024

sync : ggml

b13db51

ggerganov commited on Sep 2, 2024

ggml: fix ggml_graph_cpy undefined behavior (ggml/943)

9202e70

JohannesGaessler commited on Aug 31, 2024

cann : fix doxy (ggml/0)

406ac07

ggerganov commited on Aug 28, 2024

vulkan : fix build (llama/0)

e237370

ggerganov commited on Aug 27, 2024

cuda : mark BF16 CONT as unsupported

561bebd

ggerganov commited on Aug 28, 2024

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)

33c59fc

smeso

ggerganov commited on Aug 28, 2024

cmake : set MSVC to use UTF-8 on source files (#2346)

9b3df8e
unverified

Tim Miller commited on Aug 30, 2024

readme : remove invalid flag from Python example (#2396)

5372e8b
unverified

UsernamesLame commited on Aug 30, 2024

readme : fix link (#2394)

ae51c50
unverified

ggerganov commited on Aug 30, 2024

go : add beamsize/entropythold/maxcontext to context interface (#2350)

7efcda7
unverified

hsinhoyeh commited on Aug 28, 2024

talk-llama : sync llama.cpp

4493ffd

ggerganov commited on Aug 28, 2024

whisper : update FA call

2bfec97

ggerganov commited on Aug 28, 2024

sync : ggml

7ba8c97

ggerganov commited on Aug 28, 2024

sync : vulkan (skip) (llama/0)

5fe3dd6

ggerganov commited on Aug 27, 2024

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)

d64f932

slaren commited on Aug 26, 2024

metal : separate scale and mask from QKT in FA kernel (llama/9189)

90cc3cd

ggerganov commited on Aug 26, 2024

ggml : add SSM Metal kernels (llama/8546)

b6e7294

ggerganov commited on Aug 26, 2024

metal : gemma2 flash attention support (llama/9159)

e62fd15

slaren commited on Aug 26, 2024

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)

fb8ae8b

JohannesGaessler commited on Aug 24, 2024

Add a space to supress a cmake warning (llama/9133)

287612e

qnixsynapse commited on Aug 22, 2024

Add oneDNN primitive support (llama/9091)

b4d8c3e

KevinLy commited on Aug 22, 2024

llama : simplify Mamba with advanced batch splits (llama/8526)

f1abcb4

compilade

ggerganov commited on Aug 21, 2024

fallback mmvq (llama/9088)

4b1fda0

hengyu Alberto Cabrera Pérez commited on Aug 20, 2024

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)

5f43886

zhentaoyu commited on Aug 20, 2024

rpc : print error message when failed to connect endpoint (llama/9042)

d54b156

rgerganov commited on Aug 19, 2024

rpc : prevent crashes on invalid input (llama/9040)

656ae00

rgerganov commited on Aug 19, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047)

e0dc1ad

nicoboss commited on Aug 16, 2024

cmake : remove unused option GGML_CURL (llama/9011)

12634fc

ggerganov commited on Aug 14, 2024

ggml : move rope type enum to ggml.h (llama/8949)

9d45f48

danbev slaren commited on Aug 13, 2024

ggml: fix div-by-zero (llama/9003)

d9ee26f

DavidKorczynski commited on Aug 12, 2024

Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (llama/8943)

11bc9e6

Markus Tavenrath

OccamRazor commited on Aug 11, 2024

feat: ref. cross entropy, add CUDA, fix grad test (ggml/929)

e1e87a3

JohannesGaessler commited on Aug 27, 2024

ggml: remove bad assert (ggml/928)

ba483f7

JohannesGaessler commited on Aug 24, 2024

examples: add MNIST training + missing ops

0828065

JohannesGaessler commited on Jul 30, 2024

models : add support for wget2 for fedora (#2387)

0653499
unverified

Brad Murray commited on Aug 28, 2024

readme : update the path to bench.py (#2386)

57c7a6b
unverified

Peng commited on Aug 28, 2024

readme : fix typo (#2383)

16e5a16
unverified

ivoputzer commited on Aug 28, 2024

readme : fix broken links in implementation details section (#2382)

4863dee
unverified

stormofice commited on Aug 28, 2024

whisper : fix compile warning for unused params

0e05e03
unverified

ggerganov commited on Aug 28, 2024

sync : ggml vulkan (ggml/0)

c4c7e49

ggerganov commited on Aug 20, 2024

yolo : add backend support (ggml/924)

630d713

rgerganov

ggerganov commited on Aug 19, 2024

ggml : fix typo in ggml-quants.c comment (ggml/922)

f158bc0

danbev commited on Aug 15, 2024

feat: add new `sin` and `cos` operators (ggml/919)

f541d31

Ronsor

ggerganov commited on Aug 12, 2024

Commit History

docker : add libsdl2-dev for container builds (#2424) aa93432 unverified

go : add tests and update bindings (#2425) c80d17a unverified

server : use OS-generated temp file name for converted files (#2419) 04d9c8d unverified

go : fix CUDA build (#2416) dafe96d unverified

cann : add Ascend NPU instructions (#2410) ae9acd3 unverified

cmake: Fix libdir value in pkgconfig file (#2407) a048ef3 unverified

revert : cmake : set MSVC to use UTF-8 on source files (#2346) 5e9ff52

sync : ggml b13db51

ggml: fix ggml_graph_cpy undefined behavior (ggml/943) 9202e70

cann : fix doxy (ggml/0) 406ac07

vulkan : fix build (llama/0) e237370

cuda : mark BF16 CONT as unsupported 561bebd

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934) 33c59fc

cmake : set MSVC to use UTF-8 on source files (#2346) 9b3df8e unverified

readme : remove invalid flag from Python example (#2396) 5372e8b unverified

readme : fix link (#2394) ae51c50 unverified

go : add beamsize/entropythold/maxcontext to context interface (#2350) 7efcda7 unverified

talk-llama : sync llama.cpp 4493ffd

whisper : update FA call 2bfec97

sync : ggml 7ba8c97

sync : vulkan (skip) (llama/0) 5fe3dd6

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192) d64f932

metal : separate scale and mask from QKT in FA kernel (llama/9189) 90cc3cd

ggml : add SSM Metal kernels (llama/8546) b6e7294

metal : gemma2 flash attention support (llama/9159) e62fd15

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542) fb8ae8b

Add a space to supress a cmake warning (llama/9133) 287612e

Add oneDNN primitive support (llama/9091) b4d8c3e

llama : simplify Mamba with advanced batch splits (llama/8526) f1abcb4

fallback mmvq (llama/9088) 4b1fda0

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052) 5f43886

rpc : print error message when failed to connect endpoint (llama/9042) d54b156

rpc : prevent crashes on invalid input (llama/9040) 656ae00

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047) e0dc1ad

cmake : remove unused option GGML_CURL (llama/9011) 12634fc

ggml : move rope type enum to ggml.h (llama/8949) 9d45f48

ggml: fix div-by-zero (llama/9003) d9ee26f

Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (llama/8943) 11bc9e6

feat: ref. cross entropy, add CUDA, fix grad test (ggml/929) e1e87a3

ggml: remove bad assert (ggml/928) ba483f7

examples: add MNIST training + missing ops 0828065

models : add support for wget2 for fedora (#2387) 0653499 unverified

readme : update the path to bench.py (#2386) 57c7a6b unverified

readme : fix typo (#2383) 16e5a16 unverified

readme : fix broken links in implementation details section (#2382) 4863dee unverified

whisper : fix compile warning for unused params 0e05e03 unverified

sync : ggml vulkan (ggml/0) c4c7e49

yolo : add backend support (ggml/924) 630d713

ggml : fix typo in ggml-quants.c comment (ggml/922) f158bc0

feat: add new `sin` and `cos` operators (ggml/919) f541d31

docker : add libsdl2-dev for container builds (#2424)

aa93432
unverified

go : add tests and update bindings (#2425)

c80d17a
unverified

server : use OS-generated temp file name for converted files (#2419)

04d9c8d
unverified

go : fix CUDA build (#2416)

dafe96d
unverified

cann : add Ascend NPU instructions (#2410)

ae9acd3
unverified

cmake: Fix libdir value in pkgconfig file (#2407)

a048ef3
unverified

revert : cmake : set MSVC to use UTF-8 on source files (#2346)

5e9ff52

sync : ggml

b13db51

ggml: fix ggml_graph_cpy undefined behavior (ggml/943)

9202e70

cann : fix doxy (ggml/0)

406ac07

vulkan : fix build (llama/0)

e237370

cuda : mark BF16 CONT as unsupported

561bebd

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)

33c59fc

cmake : set MSVC to use UTF-8 on source files (#2346)

9b3df8e
unverified

readme : remove invalid flag from Python example (#2396)

5372e8b
unverified

readme : fix link (#2394)

ae51c50
unverified

go : add beamsize/entropythold/maxcontext to context interface (#2350)

7efcda7
unverified

talk-llama : sync llama.cpp

4493ffd

whisper : update FA call

2bfec97

sync : ggml

7ba8c97

sync : vulkan (skip) (llama/0)

5fe3dd6

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)

d64f932

metal : separate scale and mask from QKT in FA kernel (llama/9189)

90cc3cd

ggml : add SSM Metal kernels (llama/8546)

b6e7294

metal : gemma2 flash attention support (llama/9159)

e62fd15

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)

fb8ae8b

Add a space to supress a cmake warning (llama/9133)

287612e

Add oneDNN primitive support (llama/9091)

b4d8c3e

llama : simplify Mamba with advanced batch splits (llama/8526)

f1abcb4

fallback mmvq (llama/9088)

4b1fda0

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)

5f43886

rpc : print error message when failed to connect endpoint (llama/9042)

d54b156

rpc : prevent crashes on invalid input (llama/9040)

656ae00

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047)

e0dc1ad

cmake : remove unused option GGML_CURL (llama/9011)

12634fc

ggml : move rope type enum to ggml.h (llama/8949)

9d45f48

ggml: fix div-by-zero (llama/9003)

d9ee26f

Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (llama/8943)

11bc9e6

feat: ref. cross entropy, add CUDA, fix grad test (ggml/929)

e1e87a3

ggml: remove bad assert (ggml/928)

ba483f7

examples: add MNIST training + missing ops

0828065

models : add support for wget2 for fedora (#2387)

0653499
unverified

readme : update the path to bench.py (#2386)

57c7a6b
unverified

readme : fix typo (#2383)

16e5a16
unverified

readme : fix broken links in implementation details section (#2382)

4863dee
unverified

whisper : fix compile warning for unused params

0e05e03
unverified

sync : ggml vulkan (ggml/0)

c4c7e49

yolo : add backend support (ggml/924)

630d713

ggml : fix typo in ggml-quants.c comment (ggml/922)

f158bc0

feat: add new `sin` and `cos` operators (ggml/919)

f541d31