openvino : fix convert-whisper-to-openvino.py (#1890) dfd53cc unverified st-gr Stefan Grundmann commited on Feb 22, 2024
main : fix file existence check in main.cpp (#1889) 9162df9 unverified Theldus commited on Feb 22, 2024
ci : enable -Werror for CUDA builds (llama/5579) df03a10 unverified ggerganov commited on Feb 19, 2024
cuda, metal : fix nans in soft_max (llama/5574) 44164ac unverified slaren ggerganov commited on Feb 19, 2024
ggml : android and old glibc NUMA incompatibility bugfixes (llama/5557) 0206c2d unverified bmwl root commited on Feb 19, 2024
ggml : restore vec dot stride arg names (llama/5453) de4041f unverified ggerganov commited on Feb 18, 2024
ci : fix wikitext url + compile warnings (llama/5569) 49f0106 unverified ggerganov commited on Feb 18, 2024
ggml, common, examples, tests : fixed type arguments in printf (llama/5528) 2f3a004 unverified germanaizek commited on Feb 18, 2024
ggml : add ALiBi support for ggml_soft_max_ext (llama/5488) 26c019a unverified ggerganov commited on Feb 19, 2024
ci : add an option to fail on compile warning (llama/3952) b5903fc unverified abastola ggerganov commited on Feb 17, 2024
cmake : fix VULKAN and ROCm builds (llama/5525) ae570e4 unverified ggerganov commited on Feb 16, 2024
ggml : add numa options (llama/5377) 7c952d2 unverified bmwl root Cebtenzzre ggerganov Cebtenzzre commited on Feb 16, 2024
cuda : print message when initialization fails (llama/5512) 1f047ca unverified slaren commited on Feb 15, 2024
vulkan: Find optimal memory type but with fallback (llama/5381) 24e2319 unverified lcfrs commited on Feb 15, 2024
Early return for zero size calls to get_tensor. (llama/5482) f1f5c00 unverified AT ggerganov commited on Feb 13, 2024
ggml-quants : fix compiler warnings (shadow variable) (llama/5472) e538f25 unverified Kawrakow ikawrakow commited on Feb 13, 2024
ggml-sycl: Replace 3d ops with macro (llama/5458) 12970f1 unverified Abhilash Majumder commited on Feb 12, 2024
build : update CBLAS flags + fix unused var warning (#0) 496c0f1 unverified ggerganov commited on Feb 19, 2024
main : check if input files exist before proceeding (#1872) d625238 unverified Theldus commited on Feb 19, 2024
swift : package no longer use ggml dependency (#1861) df6227e unverified ggerganov commited on Feb 12, 2024
ggml-alloc : allocate all leafs as if they were inputs (ggml/731) a512417 unverified slaren commited on Feb 12, 2024
CUDA: mul_mat_vec_q tiling, refactor mul mat logic (llama/5434) c0cfa9b unverified JohannesGaessler slaren commited on Feb 11, 2024
vulkan: only use M-sized matmul on Apple GPUs (llama/5412) 350284e unverified Sergio López commited on Feb 11, 2024
ggml : fix compile warnings (unused vars) (llama/4966) 97fa2e3 unverified ggerganov commited on Feb 11, 2024
ggml : add mmla kernels for quantized GEMM (llama/4966) 0d50a29 unverified snadampal commited on Feb 11, 2024
metal : use autoreleasepool to avoid memory leaks (llama/5437) c276f12 unverified irbull commited on Feb 10, 2024
examples : added audio_ctx argument to main and server (#1857) 469988b unverified dscripka ggerganov commited on Feb 12, 2024
metal : option to embed MSL source into compiled binary (#1842) a46b62a unverified Didzis Gosko commited on Feb 11, 2024
examples : initialize context params properly (#1852) 3443ee7 unverified ggerganov commited on Feb 11, 2024
ggml : fix `error C2078: too many initializers` for MSVC ARM64 (llama/5404) 8ebb36c unverified Michael Podvitskiy commited on Feb 9, 2024
CUDA: more warps for mmvq on NVIDIA (llama/5394) 7ab774c unverified JohannesGaessler commited on Feb 8, 2024