fix : cuda order of synchronization when setting a buffer (ggml/679) e48c553 unverified Erik Scholz slaren commited on Jan 5, 2024
metal : switch back to default.metallib (ggml/681) b945a8f unverified ggerganov commited on Jan 5, 2024
sync : ggml (VMM, sync-ggml-am, dotprod ARM fixes, CUDA fixes) (#1691) 919a447 unverified ggerganov commited on Dec 29, 2023
sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677) aa86ade unverified ggerganov commited on Dec 22, 2023
bench.py : add different large models (#1655) 282c3a3 unverified Alfredo Montesinos commited on Dec 19, 2023
whisper : make large version explicit + fix data size units (#1493) 03a3210 unverified ggerganov commited on Nov 15, 2023
whisper : add full CUDA and Metal offloading (#1472) da4acca unverified ggerganov commited on Nov 12, 2023
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 7006035 unverified ggerganov Chris Raethke commited on Nov 3, 2023
extra: Add benchmark script implemented in Python (#1298) c587102 unverified Neil Chudleigh commited on Sep 25, 2023
extra : update 'quantize-all.sh' to quantize all downloaded models (#1054) b2215ea unverified thefinaldegree commited on Jun 28, 2023
whisper : add integer quantization support (#540) a5f8f3c unverified ggerganov commited on Apr 30, 2023
bench-wts.sh : rename script + add execute permission f0a2b23 unverified ggerganov commited on Mar 6, 2023
qual-bench.sh : add quality comparison tool, and update main.cpp to allow using a font file (#569) adb49fb unverified venkr commited on Mar 6, 2023
bench : more concise representation of the results (#89) a9f3ce0 unverified ggerganov commited on Dec 11, 2022
bench.wasm : same as "bench" but runs in the browser (#89) 68dae1f unverified ggerganov commited on Dec 11, 2022
models : add the new "large" model release by OpenAI 793fa90 unverified ggerganov commited on Dec 6, 2022
command.wasm : add voice assistant example for the Web (#171) 2ee248a unverified ggerganov commited on Nov 26, 2022