Commits · Xenobd/whisper.cpp

sync : ggml (llama/0)

cdb7964
unverified

ggerganov commited on Jan 30, 2024

Faster AVX2 dot product for IQ2_XS (llama/5187)

187ae44
unverified

Kawrakow

ikawrakow

PeterReid commited on Jan 30, 2024

SOTA 3-bit quants (llama/5196)

4649943
unverified

Kawrakow

ikawrakow commited on Jan 30, 2024

ggml alloc: Fix for null dereference on alloc failure (llama/5200)

8181686
unverified

Paul Tsochantaris commited on Jan 29, 2024

Nomic Vulkan backend (llama/4456)

f5fd92d
unverified

Cebtenzzre niansa

manyoso

apage43 ToKiNoBug

ggerganov slaren commited on Jan 29, 2024

ggml : add max buffer sizes to opencl and metal backends (llama/5181)

3d354d0
unverified

slaren commited on Jan 29, 2024

metal : free metal objects (llama/5161)

ea7167a
unverified

Paul Tsochantaris commited on Jan 28, 2024

gguf : fix comparison (ggml/715)

80cfca4
unverified

ggerganov commited on Jan 29, 2024

`ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686)

75d438c
unverified

John Balis slaren commited on Jan 29, 2024

gguf : add input validation, prevent integer overflows (ggml/709)

5bf1614
unverified

ggerganov commited on Jan 29, 2024

ci : fix yolo URLs + fix metal capture (ggml/712)

588f789
unverified

ggerganov commited on Jan 29, 2024

metal : add debug capture backend function (ggml/694)

ece88c3
unverified

Jack Mousseau

ggerganov commited on Jan 29, 2024

common : fix wav buffer detection (#1819)

bc84057
unverified

JacobLinCool commited on Jan 30, 2024

server : add fields to `verbose_json` response (#1802)

763d09d
unverified

JacobLinCool commited on Jan 30, 2024

make : update MSYS_NT (#1813)

587152f
unverified

jwijffels commited on Jan 30, 2024

talk-llama : sync llama.cpp

1453539
unverified

ggerganov commited on Jan 28, 2024

sync : ggml

278a9b3
unverified

ggerganov commited on Jan 28, 2024

ggml : add Vulkan backend (llama/2059)

5a97aba
unverified

OccamRazor

SlyEcho Concedo slaren

ggerganov commited on Jan 28, 2024

ggml : add unified SYCL backend for Intel GPUs (llama/2690)

01169e0
unverified

Abhilash Majumder jianyuzh

KevinLy

hengyu

ggerganov commited on Jan 28, 2024

ggml : minor type fix (int64_t -> size_t)

1bbb1a9
unverified

ggerganov commited on Jan 28, 2024

common : fix input buffer check (#1812)

6c38a7f
unverified

ggerganov commited on Jan 27, 2024

talk-llama : sync llama.cpp

92cfd93
unverified

ggerganov commited on Jan 27, 2024

sync : ggml

5a9540e
unverified

ggerganov commited on Jan 27, 2024

Add OpenCL add kernel (llama/5151)

f833987
unverified

OccamRazor commited on Jan 26, 2024

cuda : fix tensor size calculation for non-split buffer (llama/5145)

8f3eb65
unverified

slaren commited on Jan 26, 2024

ggml-alloc : add 10% margin to the buffer sizes (llama/5149)

c55bdf8
unverified

slaren commited on Jan 26, 2024

ggml : update softmax n_task calculation (llama/5126)

3a3eb8e
unverified

snadampal commited on Jan 26, 2024

metal : remove unused `n_buffers` and `buffers` (llama/5129)

a3e87d3
unverified

Paul Tsochantaris commited on Jan 26, 2024

metal : show compile log messages

ae08f31
unverified

ggerganov commited on Jan 25, 2024

cuda : fix 2-bit quants on amd hip (llama/5105)

aadbd67
unverified

Engininja2 commited on Jan 24, 2024

llama : pre-allocate input tensors in a separate buffer (llama/5100)

20a4ca1
unverified

slaren commited on Jan 24, 2024

metal : disable support for MUL_MAT F32 x F16

7fbc01f
unverified

ggerganov commited on Jan 23, 2024

CUDA: more info when no device code (llama/5088)

e96ba7d
unverified

JohannesGaessler commited on Jan 23, 2024

minor : clean-up some warnings and style (llama/5094)

7df090b
unverified

ggerganov commited on Jan 23, 2024

ggml : parallelize FP32 conversion when using BLAS (llama/5045)

7bf2c87
unverified

reinforce20001

ggerganov commited on Jan 22, 2024

llava : MobileVLM support (llama/4954)

dc8f956
unverified

cxt123 Chenxiaotao03 commited on Jan 22, 2024

llama : run all KQV ops on the CPU with no KV offload (llama/5049)

97ce95c
unverified

slaren commited on Jan 20, 2024

cuda : fix compile error in jetson platform (llama/4975)

0935414
unverified

Kylin commited on Jan 20, 2024

ggml : check ggml_add src1 type (ggml/708)

aa5d6ed
unverified

Judd Judd commited on Jan 26, 2024

docs : make model options / model install methods clearer (#1806)

a2bec1d
unverified

mikey-rrr commited on Jan 26, 2024

cmake : make libwhisper.so position independent (#1792)

1cf1553
unverified

trixirt commited on Jan 22, 2024

cmake : temporary remove VLA check (#1795)

1a32e6f
unverified

ggerganov commited on Jan 22, 2024

whisper.android : return output from benchmarks (#1785)

5cff61b
unverified

lcfrs commited on Jan 19, 2024

server : implement "verbose_json" format with token details (#1781)

d6e13b6
unverified

rmmh commited on Jan 18, 2024

ggml : sync ggml-metal.m

b4085c3
unverified

ggerganov commited on Jan 18, 2024

sync : llama.cpp

5de718a
unverified

ggerganov commited on Jan 17, 2024

sync : ggml

34bdd70
unverified

ggerganov commited on Jan 17, 2024

ggml : add IQ2 to test-backend-ops + refactoring (llama/4990)

227f2ae
unverified

ggerganov commited on Jan 17, 2024

imatrix : offload to GPU support (llama/4957)

6490f98
unverified

ggerganov commited on Jan 17, 2024

backend : add eval callback (llama/4935)

3cc64d6
unverified

ggerganov commited on Jan 17, 2024

Commit History

sync : ggml (llama/0) cdb7964 unverified

Faster AVX2 dot product for IQ2_XS (llama/5187) 187ae44 unverified

SOTA 3-bit quants (llama/5196) 4649943 unverified

ggml alloc: Fix for null dereference on alloc failure (llama/5200) 8181686 unverified

Nomic Vulkan backend (llama/4456) f5fd92d unverified

ggml : add max buffer sizes to opencl and metal backends (llama/5181) 3d354d0 unverified

metal : free metal objects (llama/5161) ea7167a unverified

gguf : fix comparison (ggml/715) 80cfca4 unverified

`ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686) 75d438c unverified

gguf : add input validation, prevent integer overflows (ggml/709) 5bf1614 unverified

ci : fix yolo URLs + fix metal capture (ggml/712) 588f789 unverified

metal : add debug capture backend function (ggml/694) ece88c3 unverified

common : fix wav buffer detection (#1819) bc84057 unverified

server : add fields to `verbose_json` response (#1802) 763d09d unverified

make : update MSYS_NT (#1813) 587152f unverified

talk-llama : sync llama.cpp 1453539 unverified

sync : ggml 278a9b3 unverified

ggml : add Vulkan backend (llama/2059) 5a97aba unverified

ggml : add unified SYCL backend for Intel GPUs (llama/2690) 01169e0 unverified

ggml : minor type fix (int64_t -> size_t) 1bbb1a9 unverified

common : fix input buffer check (#1812) 6c38a7f unverified

talk-llama : sync llama.cpp 92cfd93 unverified

sync : ggml 5a9540e unverified

Add OpenCL add kernel (llama/5151) f833987 unverified

cuda : fix tensor size calculation for non-split buffer (llama/5145) 8f3eb65 unverified

ggml-alloc : add 10% margin to the buffer sizes (llama/5149) c55bdf8 unverified

ggml : update softmax n_task calculation (llama/5126) 3a3eb8e unverified

metal : remove unused `n_buffers` and `buffers` (llama/5129) a3e87d3 unverified

metal : show compile log messages ae08f31 unverified

cuda : fix 2-bit quants on amd hip (llama/5105) aadbd67 unverified

llama : pre-allocate input tensors in a separate buffer (llama/5100) 20a4ca1 unverified

metal : disable support for MUL_MAT F32 x F16 7fbc01f unverified

CUDA: more info when no device code (llama/5088) e96ba7d unverified

minor : clean-up some warnings and style (llama/5094) 7df090b unverified

ggml : parallelize FP32 conversion when using BLAS (llama/5045) 7bf2c87 unverified

llava : MobileVLM support (llama/4954) dc8f956 unverified

llama : run all KQV ops on the CPU with no KV offload (llama/5049) 97ce95c unverified

cuda : fix compile error in jetson platform (llama/4975) 0935414 unverified

ggml : check ggml_add src1 type (ggml/708) aa5d6ed unverified

docs : make model options / model install methods clearer (#1806) a2bec1d unverified

cmake : make libwhisper.so position independent (#1792) 1cf1553 unverified

cmake : temporary remove VLA check (#1795) 1a32e6f unverified

whisper.android : return output from benchmarks (#1785) 5cff61b unverified

server : implement "verbose_json" format with token details (#1781) d6e13b6 unverified

ggml : sync ggml-metal.m b4085c3 unverified

sync : llama.cpp 5de718a unverified

sync : ggml 34bdd70 unverified

ggml : add IQ2 to test-backend-ops + refactoring (llama/4990) 227f2ae unverified

imatrix : offload to GPU support (llama/4957) 6490f98 unverified

backend : add eval callback (llama/4935) 3cc64d6 unverified

sync : ggml (llama/0)

cdb7964
unverified

Faster AVX2 dot product for IQ2_XS (llama/5187)

187ae44
unverified

SOTA 3-bit quants (llama/5196)

4649943
unverified

ggml alloc: Fix for null dereference on alloc failure (llama/5200)

8181686
unverified

Nomic Vulkan backend (llama/4456)

f5fd92d
unverified

ggml : add max buffer sizes to opencl and metal backends (llama/5181)

3d354d0
unverified

metal : free metal objects (llama/5161)

ea7167a
unverified

gguf : fix comparison (ggml/715)

80cfca4
unverified

`ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686)

75d438c
unverified

gguf : add input validation, prevent integer overflows (ggml/709)

5bf1614
unverified

ci : fix yolo URLs + fix metal capture (ggml/712)

588f789
unverified

metal : add debug capture backend function (ggml/694)

ece88c3
unverified

common : fix wav buffer detection (#1819)

bc84057
unverified

server : add fields to `verbose_json` response (#1802)

763d09d
unverified

make : update MSYS_NT (#1813)

587152f
unverified

talk-llama : sync llama.cpp

1453539
unverified

sync : ggml

278a9b3
unverified

ggml : add Vulkan backend (llama/2059)

5a97aba
unverified

ggml : add unified SYCL backend for Intel GPUs (llama/2690)

01169e0
unverified

ggml : minor type fix (int64_t -> size_t)

1bbb1a9
unverified

common : fix input buffer check (#1812)

6c38a7f
unverified

talk-llama : sync llama.cpp

92cfd93
unverified

sync : ggml

5a9540e
unverified

Add OpenCL add kernel (llama/5151)

f833987
unverified

cuda : fix tensor size calculation for non-split buffer (llama/5145)

8f3eb65
unverified

ggml-alloc : add 10% margin to the buffer sizes (llama/5149)

c55bdf8
unverified

ggml : update softmax n_task calculation (llama/5126)

3a3eb8e
unverified

metal : remove unused `n_buffers` and `buffers` (llama/5129)

a3e87d3
unverified

metal : show compile log messages

ae08f31
unverified

cuda : fix 2-bit quants on amd hip (llama/5105)

aadbd67
unverified

llama : pre-allocate input tensors in a separate buffer (llama/5100)

20a4ca1
unverified

metal : disable support for MUL_MAT F32 x F16

7fbc01f
unverified

CUDA: more info when no device code (llama/5088)

e96ba7d
unverified

minor : clean-up some warnings and style (llama/5094)

7df090b
unverified

ggml : parallelize FP32 conversion when using BLAS (llama/5045)

7bf2c87
unverified

llava : MobileVLM support (llama/4954)

dc8f956
unverified

llama : run all KQV ops on the CPU with no KV offload (llama/5049)

97ce95c
unverified

cuda : fix compile error in jetson platform (llama/4975)

0935414
unverified

ggml : check ggml_add src1 type (ggml/708)

aa5d6ed
unverified

docs : make model options / model install methods clearer (#1806)

a2bec1d
unverified

cmake : make libwhisper.so position independent (#1792)

1cf1553
unverified

cmake : temporary remove VLA check (#1795)

1a32e6f
unverified

whisper.android : return output from benchmarks (#1785)

5cff61b
unverified

server : implement "verbose_json" format with token details (#1781)

d6e13b6
unverified

ggml : sync ggml-metal.m

b4085c3
unverified

sync : llama.cpp

5de718a
unverified

sync : ggml

34bdd70
unverified

ggml : add IQ2 to test-backend-ops + refactoring (llama/4990)

227f2ae
unverified

imatrix : offload to GPU support (llama/4957)

6490f98
unverified

backend : add eval callback (llama/4935)

3cc64d6
unverified