Commit History

ggml/examples: add backend support for numerical optimization (ggml/949)
5c178b0

JohannesGaessler ggerganov slaren commited on

examples : add null threadpool args where needed (ggml/0)
0bb7364

ggerganov commited on

metal : update support condition for im2col + fix warning (llama/0)
ed9150a

ggerganov commited on

ggml : always check bounds on get_rows operations (llama/9354)
a13c99b

slaren commited on

ggml : fix missing `cpu_set_t` on emscripten (llama/9336)
d8c76ac

Xuan Son Nguyen commited on

Improve Vulkan shader build system (llama/9239)
9746f77

Markus Tavenrath commited on

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)
d1c244a

compilade commited on

cuda : fix defrag with quantized KV (llama/9319)
061ca37

slaren commited on

ggml : AVX2 support for Q4_0_8_8 (llama/8713)
480ad4d

Srihari-mcw commited on

Fix DMMV dequantization (llama/9279)
aa12d61

Ouadie EL FAROUKI commited on

ggml : add pthread includes on FreeBSD (llama/9258)
d558e0e

yuri@FreeBSD commited on

Threadpool: take 2 (llama/8672)
e3e9ca4

Faisal Zaghloul Max Krasnyansky quic-fzaghlou Max Krasnyansky slaren commited on

vulkan: fix compilation with GGML_VULKAN_DEBUG=ON (ggml/948)
7f60aae

smeso commited on

vulkan: add dryrun support to sin and cos ops (ggml/947)
e2fe267

smeso commited on

vulkan: correctly report support for OP_CONT (ggml/946)
098f7fa

smeso commited on

tests: add gradient tests for all backends (ggml/932)
4751b2f

JohannesGaessler commited on

go : add temperature options (#2417)
5b36f0b
unverified

Binozo Binozo commited on

docker : add libsdl2-dev for container builds (#2424)
aa93432
unverified

JohnnyB commited on

go : add tests and update bindings (#2425)
c80d17a
unverified

Stavros Panakakis commited on

server : use OS-generated temp file name for converted files (#2419)
04d9c8d
unverified

teejae commited on

go : fix CUDA build (#2416)
dafe96d
unverified

Binozo Binozo commited on

cann : add Ascend NPU instructions (#2410)
ae9acd3
unverified

Mimi89757 commited on

cmake: Fix libdir value in pkgconfig file (#2407)
a048ef3
unverified

Philippe Normand commited on

revert : cmake : set MSVC to use UTF-8 on source files (#2346)
5e9ff52

ggerganov commited on

sync : ggml
b13db51

ggerganov commited on

ggml: fix ggml_graph_cpy undefined behavior (ggml/943)
9202e70

JohannesGaessler commited on

cann : fix doxy (ggml/0)
406ac07

ggerganov commited on

vulkan : fix build (llama/0)
e237370

ggerganov commited on

cuda : mark BF16 CONT as unsupported
561bebd

ggerganov commited on

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)
33c59fc

smeso ggerganov commited on

cmake : set MSVC to use UTF-8 on source files (#2346)
9b3df8e
unverified

Tim Miller commited on

readme : remove invalid flag from Python example (#2396)
5372e8b
unverified

UsernamesLame commited on

readme : fix link (#2394)
ae51c50
unverified

ggerganov commited on

go : add beamsize/entropythold/maxcontext to context interface (#2350)
7efcda7
unverified

hsinhoyeh commited on

talk-llama : sync llama.cpp
4493ffd

ggerganov commited on

whisper : update FA call
2bfec97

ggerganov commited on

sync : ggml
7ba8c97

ggerganov commited on

sync : vulkan (skip) (llama/0)
5fe3dd6

ggerganov commited on

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)
d64f932

slaren commited on

metal : separate scale and mask from QKT in FA kernel (llama/9189)
90cc3cd

ggerganov commited on

ggml : add SSM Metal kernels (llama/8546)
b6e7294

ggerganov commited on

metal : gemma2 flash attention support (llama/9159)
e62fd15

slaren commited on

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)
fb8ae8b

JohannesGaessler commited on

Add a space to supress a cmake warning (llama/9133)
287612e

qnixsynapse commited on

Add oneDNN primitive support (llama/9091)
b4d8c3e

KevinLy commited on

llama : simplify Mamba with advanced batch splits (llama/8526)
f1abcb4

compilade ggerganov commited on

fallback mmvq (llama/9088)
4b1fda0

hengyu Alberto Cabrera Pérez commited on

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)
5f43886

zhentaoyu commited on

rpc : print error message when failed to connect endpoint (llama/9042)
d54b156

rgerganov commited on