Commits · natasa365/whisper.cpp

Improve Vulkan shader build system (llama/9239)

9746f77

Markus Tavenrath commited on Sep 6, 2024

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)

d1c244a

compilade commited on Sep 6, 2024

cuda : fix defrag with quantized KV (llama/9319)

061ca37

slaren commited on Sep 5, 2024

ggml : AVX2 support for Q4_0_8_8 (llama/8713)

480ad4d

Srihari-mcw commited on Sep 4, 2024

Fix DMMV dequantization (llama/9279)

aa12d61

Ouadie EL FAROUKI commited on Sep 4, 2024

ggml : add pthread includes on FreeBSD (llama/9258)

d558e0e

yuri@FreeBSD commited on Sep 2, 2024

llama : support RWKV v6 models (llama/8980)

bd4f5ec

mollysama Layl Bongers

compilade

ggerganov commited on Sep 1, 2024

Threadpool: take 2 (llama/8672)

e3e9ca4

Faisal Zaghloul Max Krasnyansky

quic-fzaghlou Max Krasnyansky slaren commited on Aug 29, 2024

vulkan: fix compilation with GGML_VULKAN_DEBUG=ON (ggml/948)

7f60aae

smeso commited on Sep 6, 2024

vulkan: add dryrun support to sin and cos ops (ggml/947)

e2fe267

smeso commited on Sep 6, 2024

vulkan: correctly report support for OP_CONT (ggml/946)

098f7fa

smeso commited on Sep 6, 2024

tests: add gradient tests for all backends (ggml/932)

4751b2f

JohannesGaessler commited on Sep 3, 2024

go : add temperature options (#2417)

5b36f0b
unverified

Binozo Binozo commited on Sep 20, 2024

docker : add libsdl2-dev for container builds (#2424)

aa93432
unverified

JohnnyB commited on Sep 20, 2024

go : add tests and update bindings (#2425)

c80d17a
unverified

Stavros Panakakis commited on Sep 20, 2024

server : use OS-generated temp file name for converted files (#2419)

04d9c8d
unverified

teejae commited on Sep 17, 2024

go : fix CUDA build (#2416)

dafe96d
unverified

Binozo Binozo commited on Sep 15, 2024

cann : add Ascend NPU instructions (#2410)

ae9acd3
unverified

Mimi89757 commited on Sep 11, 2024

cmake: Fix libdir value in pkgconfig file (#2407)

a048ef3
unverified

Philippe Normand commited on Sep 7, 2024

revert : cmake : set MSVC to use UTF-8 on source files (#2346)

5e9ff52

ggerganov commited on Sep 2, 2024

sync : ggml

b13db51

ggerganov commited on Sep 2, 2024

ggml: fix ggml_graph_cpy undefined behavior (ggml/943)

9202e70

JohannesGaessler commited on Aug 31, 2024

cann : fix doxy (ggml/0)

406ac07

ggerganov commited on Aug 28, 2024

vulkan : fix build (llama/0)

e237370

ggerganov commited on Aug 27, 2024

cuda : mark BF16 CONT as unsupported

561bebd

ggerganov commited on Aug 28, 2024

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)

33c59fc

smeso

ggerganov commited on Aug 28, 2024

cmake : set MSVC to use UTF-8 on source files (#2346)

9b3df8e
unverified

Tim Miller commited on Aug 30, 2024

readme : remove invalid flag from Python example (#2396)

5372e8b
unverified

UsernamesLame commited on Aug 30, 2024

readme : fix link (#2394)

ae51c50
unverified

ggerganov commited on Aug 30, 2024

go : add beamsize/entropythold/maxcontext to context interface (#2350)

7efcda7
unverified

hsinhoyeh commited on Aug 28, 2024

talk-llama : sync llama.cpp

4493ffd

ggerganov commited on Aug 28, 2024

whisper : update FA call

2bfec97

ggerganov commited on Aug 28, 2024

sync : ggml

7ba8c97

ggerganov commited on Aug 28, 2024

sync : vulkan (skip) (llama/0)

5fe3dd6

ggerganov commited on Aug 27, 2024

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)

d64f932

slaren commited on Aug 26, 2024

metal : separate scale and mask from QKT in FA kernel (llama/9189)

90cc3cd

ggerganov commited on Aug 26, 2024

ggml : add SSM Metal kernels (llama/8546)

b6e7294

ggerganov commited on Aug 26, 2024

metal : gemma2 flash attention support (llama/9159)

e62fd15

slaren commited on Aug 26, 2024

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)

fb8ae8b

JohannesGaessler commited on Aug 24, 2024

Add a space to supress a cmake warning (llama/9133)

287612e

qnixsynapse commited on Aug 22, 2024

Add oneDNN primitive support (llama/9091)

b4d8c3e

KevinLy commited on Aug 22, 2024

llama : simplify Mamba with advanced batch splits (llama/8526)

f1abcb4

compilade

ggerganov commited on Aug 21, 2024

fallback mmvq (llama/9088)

4b1fda0

hengyu Alberto Cabrera Pérez commited on Aug 20, 2024

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)

5f43886

zhentaoyu commited on Aug 20, 2024

rpc : print error message when failed to connect endpoint (llama/9042)

d54b156

rgerganov commited on Aug 19, 2024

rpc : prevent crashes on invalid input (llama/9040)

656ae00

rgerganov commited on Aug 19, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047)

e0dc1ad

nicoboss commited on Aug 16, 2024

cmake : remove unused option GGML_CURL (llama/9011)

12634fc

ggerganov commited on Aug 14, 2024

ggml : move rope type enum to ggml.h (llama/8949)

9d45f48

danbev slaren commited on Aug 13, 2024

ggml: fix div-by-zero (llama/9003)

d9ee26f

DavidKorczynski commited on Aug 12, 2024

Commit History

Improve Vulkan shader build system (llama/9239) 9746f77

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151) d1c244a

cuda : fix defrag with quantized KV (llama/9319) 061ca37

ggml : AVX2 support for Q4_0_8_8 (llama/8713) 480ad4d

Fix DMMV dequantization (llama/9279) aa12d61

ggml : add pthread includes on FreeBSD (llama/9258) d558e0e

llama : support RWKV v6 models (llama/8980) bd4f5ec

Threadpool: take 2 (llama/8672) e3e9ca4

vulkan: fix compilation with GGML_VULKAN_DEBUG=ON (ggml/948) 7f60aae

vulkan: add dryrun support to sin and cos ops (ggml/947) e2fe267

vulkan: correctly report support for OP_CONT (ggml/946) 098f7fa

tests: add gradient tests for all backends (ggml/932) 4751b2f

go : add temperature options (#2417) 5b36f0b unverified

docker : add libsdl2-dev for container builds (#2424) aa93432 unverified

go : add tests and update bindings (#2425) c80d17a unverified

server : use OS-generated temp file name for converted files (#2419) 04d9c8d unverified

go : fix CUDA build (#2416) dafe96d unverified

cann : add Ascend NPU instructions (#2410) ae9acd3 unverified

cmake: Fix libdir value in pkgconfig file (#2407) a048ef3 unverified

revert : cmake : set MSVC to use UTF-8 on source files (#2346) 5e9ff52

sync : ggml b13db51

ggml: fix ggml_graph_cpy undefined behavior (ggml/943) 9202e70

cann : fix doxy (ggml/0) 406ac07

vulkan : fix build (llama/0) e237370

cuda : mark BF16 CONT as unsupported 561bebd

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934) 33c59fc

cmake : set MSVC to use UTF-8 on source files (#2346) 9b3df8e unverified

readme : remove invalid flag from Python example (#2396) 5372e8b unverified

readme : fix link (#2394) ae51c50 unverified

go : add beamsize/entropythold/maxcontext to context interface (#2350) 7efcda7 unverified

talk-llama : sync llama.cpp 4493ffd

whisper : update FA call 2bfec97

sync : ggml 7ba8c97

sync : vulkan (skip) (llama/0) 5fe3dd6

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192) d64f932

metal : separate scale and mask from QKT in FA kernel (llama/9189) 90cc3cd

ggml : add SSM Metal kernels (llama/8546) b6e7294

metal : gemma2 flash attention support (llama/9159) e62fd15

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542) fb8ae8b

Add a space to supress a cmake warning (llama/9133) 287612e

Add oneDNN primitive support (llama/9091) b4d8c3e

llama : simplify Mamba with advanced batch splits (llama/8526) f1abcb4

fallback mmvq (llama/9088) 4b1fda0

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052) 5f43886

rpc : print error message when failed to connect endpoint (llama/9042) d54b156

rpc : prevent crashes on invalid input (llama/9040) 656ae00

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047) e0dc1ad

cmake : remove unused option GGML_CURL (llama/9011) 12634fc

ggml : move rope type enum to ggml.h (llama/8949) 9d45f48

ggml: fix div-by-zero (llama/9003) d9ee26f

Improve Vulkan shader build system (llama/9239)

9746f77

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)

d1c244a

cuda : fix defrag with quantized KV (llama/9319)

061ca37

ggml : AVX2 support for Q4_0_8_8 (llama/8713)

480ad4d

Fix DMMV dequantization (llama/9279)

aa12d61

ggml : add pthread includes on FreeBSD (llama/9258)

d558e0e

llama : support RWKV v6 models (llama/8980)

bd4f5ec

Threadpool: take 2 (llama/8672)

e3e9ca4

vulkan: fix compilation with GGML_VULKAN_DEBUG=ON (ggml/948)

7f60aae

vulkan: add dryrun support to sin and cos ops (ggml/947)

e2fe267

vulkan: correctly report support for OP_CONT (ggml/946)

098f7fa

tests: add gradient tests for all backends (ggml/932)

4751b2f

go : add temperature options (#2417)

5b36f0b
unverified

docker : add libsdl2-dev for container builds (#2424)

aa93432
unverified

go : add tests and update bindings (#2425)

c80d17a
unverified

server : use OS-generated temp file name for converted files (#2419)

04d9c8d
unverified

go : fix CUDA build (#2416)

dafe96d
unverified

cann : add Ascend NPU instructions (#2410)

ae9acd3
unverified

cmake: Fix libdir value in pkgconfig file (#2407)

a048ef3
unverified

revert : cmake : set MSVC to use UTF-8 on source files (#2346)

5e9ff52

sync : ggml

b13db51

ggml: fix ggml_graph_cpy undefined behavior (ggml/943)

9202e70

cann : fix doxy (ggml/0)

406ac07

vulkan : fix build (llama/0)

e237370

cuda : mark BF16 CONT as unsupported

561bebd

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)

33c59fc

cmake : set MSVC to use UTF-8 on source files (#2346)

9b3df8e
unverified

readme : remove invalid flag from Python example (#2396)

5372e8b
unverified

readme : fix link (#2394)

ae51c50
unverified

go : add beamsize/entropythold/maxcontext to context interface (#2350)

7efcda7
unverified

talk-llama : sync llama.cpp

4493ffd

whisper : update FA call

2bfec97

sync : ggml

7ba8c97

sync : vulkan (skip) (llama/0)

5fe3dd6

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)

d64f932

metal : separate scale and mask from QKT in FA kernel (llama/9189)

90cc3cd

ggml : add SSM Metal kernels (llama/8546)

b6e7294

metal : gemma2 flash attention support (llama/9159)

e62fd15

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)

fb8ae8b

Add a space to supress a cmake warning (llama/9133)

287612e

Add oneDNN primitive support (llama/9091)

b4d8c3e

llama : simplify Mamba with advanced batch splits (llama/8526)

f1abcb4

fallback mmvq (llama/9088)

4b1fda0

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)

5f43886

rpc : print error message when failed to connect endpoint (llama/9042)

d54b156

rpc : prevent crashes on invalid input (llama/9040)

656ae00

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047)

e0dc1ad

cmake : remove unused option GGML_CURL (llama/9011)

12634fc

ggml : move rope type enum to ggml.h (llama/8949)

9d45f48

ggml: fix div-by-zero (llama/9003)

d9ee26f