cmake : add library versioning (#1352) 3ab7ee7 unverified kennethge ggerganov commited on Mar 4, 2024
readme : recommend MacOS Sonoma for Core ML (#1917) c4e849b unverified Gavin Cai commited on Mar 4, 2024
ggml : make i-quants work with super-blocks of 64 (CPU,Metal) (llama/5760) 9a07f42 unverified Kawrakow ikawrakow commited on Feb 28, 2024
Attempt to fix android build (llama/5752) e720b3b unverified Kawrakow ikawrakow commited on Feb 27, 2024
IQ4_XS: a 4.25 bpw quantization (llama/5747) 0ee1bfb unverified Kawrakow ikawrakow commited on Feb 27, 2024
cuda : replace remaining shfl_xor with calls to warp_reduce functions (llama/5744) 753b30d unverified Engininja2 commited on Feb 27, 2024
ggml-quants : fix avx2 iq1_s vec_dot when compiled with gcc (llama/5742) 72e8610 unverified Engininja2 commited on Feb 27, 2024
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama/5721) 2b9bb9e unverified Kawrakow ikawrakow ggerganov commited on Feb 26, 2024
CUDA: fix DEBUG_CUDA_MALLOC (llama/5729) f18f386 unverified JohannesGaessler commited on Feb 26, 2024
Add support for soft_max ALiBi (llama/5639) 86d6a5e unverified AidanBeltonS Abhilash Majumder commited on Feb 26, 2024
ggml-quants : provide ggml_vqtbl1q_u8 for 64bit compatibility (llama/5711) 430efc6 unverified Crad commited on Feb 25, 2024
stream.wasm : fix invalid memory access when no segments (#1902) 3273767 unverified Andrew S commited on Feb 26, 2024
IQ3_S: a much better alternative to Q3_K (llama/5676) 32589c9 unverified Kawrakow ikawrakow commited on Feb 24, 2024
talk, talk-llama : pass text_to_speak as a file (#1865) 3fd8b4d unverified Tamotsu Takahashi commited on Feb 24, 2024
ggml : always define ggml_fp16_t as uint16_t (llama/5666) bc567d3 unverified ggerganov commited on Feb 22, 2024
Update ggml_sycl_op_mul_mat_vec_q (llama/5502) 963ffd5 unverified AidanBeltonS Abhilash Majumder commited on Feb 20, 2024
Refactor validation and enumeration platform checks into functions to clean up ggml_vk_instance_init() 8637c17 unverified OccamRazor commited on Feb 14, 2024
Add check for VK_KHR_portability_enumeration for MoltenVK support 85caa3f unverified OccamRazor commited on Feb 10, 2024
Resolve ErrorIncompatibleDriver with Vulkan on MacOS. 0bc3433 unverified dokterbob commited on Feb 3, 2024
cuda : ignore peer access already enabled errors (llama/5597) a817d85 unverified slaren commited on Feb 19, 2024
ggml : compute forward no longer pass src tensors (ggml/729) 4e31c82 unverified Siddharth Ramakrishnan siddharthvader commited on Feb 21, 2024
openvino : fix convert-whisper-to-openvino.py (#1890) dfd53cc unverified st-gr Stefan Grundmann commited on Feb 22, 2024
main : fix file existence check in main.cpp (#1889) 9162df9 unverified Theldus commited on Feb 22, 2024
ci : enable -Werror for CUDA builds (llama/5579) df03a10 unverified ggerganov commited on Feb 19, 2024