ggml : tmp workaround for whisper.cpp (skip) (#2565) ef26f48 unverified ggerganov commited on Nov 16, 2024
whisper.swiftui : switch Mac dest to Mac (Designed for iPad) (#2562) 13f2beb unverified jhenhong commited on Nov 15, 2024
sycl : Fixes to broken builds and test-backend-ops (llama/10257) 9cfb13b Alberto Cabrera Pérez commited on Nov 13, 2024
vulkan: Throttle the number of shader compiles during the build step. (llama/10222) 9677a2f jeffbolznv commited on Nov 11, 2024
vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (llama/10226) 76b8073 jeffbolznv commited on Nov 10, 2024
metal : reorder write loop in mul mat kernel + style (llama/10231) 661360d ggerganov commited on Nov 9, 2024
metal : fix F32 accumulation in FA vec kernel (llama/10232) 228e0b2 ggerganov commited on Nov 9, 2024
ggml: fix zero division in ‘dne’ calculation in CUDA COUNT_EQUAL operator when ‘ne’ is small (#10213) 0ecc4d6 sxx-404 commited on Nov 9, 2024
ggml : optimize llamafile cpu matrix multiplication for ppc64le (llama/10156) 18bdb35 amritahs-ibm commited on Nov 9, 2024
ggml : add ggml-cpu.h to the public headers (llama/10204) 936a35f Diego Devesa commited on Nov 7, 2024
fix q4_0_8_8 format for corrupted tokens issue (llama/10198) 4700b48 snadampal EC2 Default User commited on Nov 7, 2024
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (llama/10133) f58e658 Zhiyuan Li ggerganov Diego Devesa pacominev Yuri Khrustalev Meng, Hengyu commited on Nov 7, 2024
ggml : fix q4xx mat mul, increase ggml_aligned_malloc alignment (llama/10167) ba20d5c Diego Devesa commited on Nov 4, 2024
cuda : clear error after changing peer access (llama/10153) 106cf6f Diego Devesa commited on Nov 4, 2024
metal : move dequantize templates to beginning of MSL source (llama/0) af0525c ggerganov commited on Nov 4, 2024
ggml : move CPU backend to a separate file (llama/10144) 0f447f2 Diego Devesa commited on Nov 3, 2024
llama : add simple-chat example (llama/10124) 41ff26f Diego Devesa Xuan Son Nguyen commited on Nov 1, 2024
llama : use smart pointers for ggml resources (llama/10117) 6b82135 Diego Devesa commited on Nov 1, 2024
vulkan : improve ggml_vk_create_buffer error handling (llama/9898) 2ce4d02 shupeif commited on Nov 1, 2024
build: fix build error in Windows env with OneAPI setup (llama/10107) e295a3f Zhenwei Jin commited on Nov 1, 2024
llama : fix buffer checks for mamba and rwk (llama/10111) 9df9767 Diego Devesa commited on Oct 31, 2024