ci : fix building workflow for linux/arm64 container (#2555) 37cb027 unverified Raiya Araki commited on Nov 15, 2024
whisper.swiftui : add model download list & bench methods (#2546) 5767578 unverified jhenhong commited on Nov 13, 2024
whisper : fix extra memory usage (#2534) 96efeba unverified Vin Misra vinmisra commited on Nov 6, 2024
whisper : backend registry init before model load 606cb52 ggerganov HF Staff commited on Oct 31, 2024
metal : support permuted matrix multiplicaions (llama/10033) efb86a3 ggerganov HF Staff commited on Oct 25, 2024
CUDA: fix insufficient buffer clearing for MMQ (llama/10032) a41f94c JohannesGaessler commited on Oct 24, 2024
CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021) bcbaad3 JohannesGaessler commited on Oct 24, 2024
Adapt to dynamically loadable backends mechanism (llama/9970) f8d4728 leo-pony commited on Oct 22, 2024
ggml : add asserts for type conversion in fattn kernels (llama/9971) 9542e42 ggerganov HF Staff commited on Oct 21, 2024
fix mul_mat_vec_q and *_vec_q error (llama/9939) 691e6ac Neo Zhang Jianyu arthw commited on Oct 21, 2024
Add SYCL Backend registry, device and Event Interfaces (llama/9705) f35cae5 Ouadie EL FAROUKI commited on Oct 18, 2024
vulkan : add backend registry / device interfaces (llama/9721) df2cb6e Diego Devesa commited on Oct 17, 2024
fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875) cf75979 Gilad S commited on Oct 16, 2024
Vectorize load instructions in dmmv f16 CUDA kernel (llama/9816) ddb0222 agray3 JohannesGaessler commited on Oct 14, 2024
ggml : move more prints to the ggml log system (llama/9839) 98d1a6a Diego Devesa commited on Oct 11, 2024
rpc : add backend registry / device interfaces (llama/9812) 4ac768e Diego Devesa commited on Oct 10, 2024
ggml : add backend registry / device interfaces to BLAS backend (llama/9752) 7f269bb Diego Devesa commited on Oct 7, 2024
ggml : add metal backend registry / device (llama/9713) b6adf19 ggerganov HF Staff slaren commited on Oct 7, 2024
metal : single allocation of encode_async block (llama/9747) 6e1b44c Paul Tsochantaris ggerganov HF Staff commited on Oct 7, 2024
ggml : alloc ggml_contexts on the heap (#2525) 3ccf40a unverified ggerganov HF Staff commited on Oct 31, 2024
ci : fix openblas build (#2511) b116fe7 unverified ggerganov HF Staff Tamotsu Takahashi commited on Oct 30, 2024
scripts : add turbo-q8_0 to the benchmark e923761 unverified ggerganov HF Staff commited on Oct 29, 2024
whisper : move new-segment callback after DTW step (#2515) 6b8369d unverified jettoblack commited on Oct 29, 2024
ruby : support new-segment callback (#2506) ae07b89 unverified KitaitiMakoto commited on Oct 28, 2024
whisper : fix index overflow in token-level timestamp logic (#2505) de79ec1 unverified josscii commited on Oct 23, 2024
readme : update links and make commands (#2489) 3767b95 unverified toboil-features commited on Oct 17, 2024