ggml : remove ggml_cpy_inplace and ggml_cont_inplace (ggml/693) 6469bfe unverified Timothy Cronin commited on Jan 11, 2024
metal : wrap each operation in debug group (ggml/690) b5e360f unverified Jack Mousseau commited on Jan 10, 2024
ggml : change GGML_MAX_NAME at compile time (ggml/682) ded2b1a unverified leejet commited on Jan 10, 2024
CUDA: fixed redundant value dequantization (llama/4809) 70c8d60 unverified JohannesGaessler commited on Jan 7, 2024
ggml : use __builtin_amdgcn_sudot4 in __dp4a for gfx11 (llama/4787) f391d7a unverified Konstantin Zhuravlyov commited on Jan 7, 2024
ggml : do not sched_yield when calling BLAS (llama/4761) 5d1dffc unverified ggerganov commited on Jan 5, 2024
ggml : include stdlib.h before intrin.h (llama/4736) 743cace unverified ggerganov commited on Jan 4, 2024
swift : checkout ggml commit instead of branch (#1750) 6ab88cc unverified Alexandru Mariuti commited on Jan 10, 2024
talk-llama : add optional Piper TTS support (#1749) fb92e62 unverified RhinoDevel commited on Jan 10, 2024
main : add cli option to disable system prints (#1740) 97e710a unverified ggerganov commited on Jan 8, 2024
server : fix server temperature + add temperature_inc (#1729) 8a648fc unverified ggerganov commited on Jan 7, 2024
fix : cuda order of synchronization when setting a buffer (ggml/679) e48c553 unverified Erik Scholz slaren commited on Jan 5, 2024
metal : switch back to default.metallib (ggml/681) b945a8f unverified ggerganov commited on Jan 5, 2024
swift : update Package.swift to use ggml as package dependency (#1701) 77f731f unverified 1-ashraful-islam commited on Jan 3, 2024
ggml : add error handling to graph_compute (#1714) 92f24ee unverified finnvoorhees commited on Jan 3, 2024
metal : optimize ggml_mul_mat_id (faster Mixtral PP) (llama/4725) 8bc6274 ggerganov commited on Jan 2, 2024
metal : enable shader debugging (cmake option) (llama/4705) 7dd37dc ggerganov commited on Jan 2, 2024
CUDA: fixed tensor cores not being used on RDNA3 (llama/4697) 654d245 JohannesGaessler commited on Dec 30, 2023
CUDA: fix tensor core logic for Pascal and HIP (llama/4682) 977baeb JohannesGaessler commited on Dec 29, 2023
ggml : extend ggml_get_rows, ggml_repeat, ggml_concat (ggml/639) f17d170 Guillaume Wenzek ggerganov commited on Dec 29, 2023
docker : fix the publishing of the CUDA Docker image (#1704) 6091193 unverified bobqianic commited on Dec 30, 2023
ci : build with CLBlast + ggml-opencl use GGML_API (#1576) 41a13d4 unverified Tamotsu Takahashi commited on Dec 29, 2023
whisper : replace `tensor->n_dims` with `ggml_n_dims(tensor)` (#1694) cee2822 unverified bobqianic commited on Dec 29, 2023
sync : ggml (VMM, sync-ggml-am, dotprod ARM fixes, CUDA fixes) (#1691) 919a447 unverified ggerganov commited on Dec 29, 2023
whisper : Replace WHISPER_PRINT_DEBUG with WHISPER_LOG_DEBUG (#1681) 5ad04c9 unverified bobqianic commited on Dec 23, 2023
sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677) aa86ade unverified ggerganov commited on Dec 22, 2023
CI : Add coverage for talk-llama when WHISPER_CUBLAS=1 (#1672) 983e4bd unverified bobqianic commited on Dec 21, 2023
examples : Revert CMakeLists.txt for talk-llama (#1669) 92a92ed unverified bobqianic commited on Dec 21, 2023