Commit History

whisper : fix with_state methods to use the correct state (#1519)
2b4a95c
unverified

sandrohanea Sandro Hanea commited on

whisper : fix overriding the audio context
d717a20
unverified

ggerganov HF Staff commited on

cuda : assert ggml_add sources to be contiguous
c012035
unverified

ggerganov HF Staff commited on

ios : sync submodule
c0b01ea
unverified

ggerganov HF Staff commited on

sync : ggml (ggml-alloc + linker + gguf fixes) (#1501)
58507b9
unverified

ggerganov HF Staff commited on

quantize : add support for K-quant types
91ed6bd
unverified

ggerganov HF Staff commited on

bench : fix memcpy bench size
97246e8
unverified

ggerganov HF Staff commited on

talk-llama : improve quote and backtick handling (#1364)
fa6a8a8
unverified

Sam Pullara commited on

talk-llama : enable GPU by default
afd6523
unverified

ggerganov HF Staff commited on

models : add info about distilled models
8c251e2
unverified

ggerganov HF Staff commited on

release : v1.5.0
65d366a
unverified

ggerganov HF Staff commited on

bench-all : add distil models
09d2496
unverified

ggerganov HF Staff commited on

js : latest whisper.js
4b2fe60
unverified

ggerganov HF Staff commited on

bench-all : indentations
92df795
unverified

ggerganov HF Staff commited on

whisper : make large version explicit + fix data size units (#1493)
03a3210
unverified

ggerganov HF Staff commited on

java : fix test (#1492)
d6276a7
unverified

ggerganov HF Staff commited on

whisper : add batched decoding (#1486)
0131aa6
unverified

ggerganov HF Staff commited on

java : use tiny.en for tests (#1484)
d03f526
unverified

ggerganov HF Staff commited on

whisper : add grammar-based sampling (#1229)
46f0c56
unverified

evanqjones ggerganov HF Staff commited on

talk-llama : add n_gpu_layers parameter (#1475)
aa7c2e9
unverified

TheJCDenton commited on

examples : add whisper.android.java for compatibility with older Android versions using Java (#1382)
bb48f57
unverified

Tong Li litongmacos bobqianic commited on

readme : update comment about source code
730c424
unverified

ggerganov HF Staff commited on

ggml : fix some compile warnings
ad6c9c1
unverified

ggerganov HF Staff commited on

readme : update GPU / CUDA
e8ad308
unverified

ggerganov HF Staff commited on

whisper : add full CUDA and Metal offloading (#1472)
da4acca
unverified

ggerganov HF Staff commited on

whisper : return with error from whisper_encode_internal and whisper_decode_internal when abort callback is true (#1456)
ca0dc29
unverified

Ben Nortier bjnortier commited on

talk-llama : add language auto detect (#1467)
cfc50d3
unverified

Jakub Ráček ggerganov HF Staff commited on

openvino : update convert-whisper-to-openvino.py to support v3 (#1459)
b9f3d1b
unverified

bobqianic commited on

coreml : use the correct `n_mel` value (#1458)
0220892
unverified

mljxy commited on

whisper : reset mel time when resetting timings (#1452)
8f21423
unverified

Ben Nortier bjnortier commited on

ios : add support for Swift Package Manager (#1370)
3fa9a6f
unverified

sindresorhus commited on

release : v1.4.3
4d80650
unverified

ggerganov HF Staff commited on

ggml : fix MIN / MAX macro re-definition
1344fc4
unverified

ggerganov HF Staff commited on

whisper : add support for large v3 (#1444)
9908abb
unverified

ggerganov HF Staff commited on

android : decouple example into a library and app module (#1445)
9f860c0
unverified

tobrun commited on

whisper : reset ctx->t_start_us when calling whisper_reset_timings() (#1434)
f0f722d
unverified

Ben Nortier bjnortier commited on

cmake : fix talk-llama build
31ebf9f
unverified

ggerganov HF Staff commited on

metal : fix asserts for setThreadgroupMemoryLength (close #1435)
b42b45f
unverified

ggerganov HF Staff commited on

ci : fix variable names in GitHub actions config (#1440)
66cb760
unverified

iamthad commited on

talk-llama : fix n_gpu_layers usage again (#1442)
37d6862
unverified

jhenhong commited on

whisper : add missing about callback initializers
a94a8ce
unverified

ggerganov HF Staff commited on

examples : fix n_gpu_layers usage in talk-llama (#1441)
e0ea7d1
unverified

jhenhong commited on

whisper : add context param to disable gpu (#1293)
290abed
unverified

jhenhong ggerganov HF Staff commited on

whisper : add support for new distilled Whisper models (#1424)
a570c92
unverified

ggerganov HF Staff commited on

cuda : fix HIPBLAS build
46033e6
unverified

ggerganov HF Staff commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov HF Staff Chris Raethke commited on

models : use absolute paths for the converted model (#1356)
6023f2d
unverified

bobqianic commited on

talk-llama : move up-to-date demo to top (#1417)
060e781
unverified

asadm commited on

talk-llama : add an up-to-date demo video
b41f03a
unverified

ggerganov HF Staff commited on

examples : Implement JSON output for Token-Level data in main (#1358)
d166741
unverified

akx commited on