Commit History

docker : add libsdl2-dev for container builds (#2424)
aa93432
unverified

JohnnyB commited on

go : add tests and update bindings (#2425)
c80d17a
unverified

Stavros Panakakis commited on

server : use OS-generated temp file name for converted files (#2419)
04d9c8d
unverified

teejae commited on

go : fix CUDA build (#2416)
dafe96d
unverified

Binozo Binozo commited on

cann : add Ascend NPU instructions (#2410)
ae9acd3
unverified

Mimi89757 commited on

cmake: Fix libdir value in pkgconfig file (#2407)
a048ef3
unverified

Philippe Normand commited on

revert : cmake : set MSVC to use UTF-8 on source files (#2346)
5e9ff52

ggerganov commited on

sync : ggml
b13db51

ggerganov commited on

ggml: fix ggml_graph_cpy undefined behavior (ggml/943)
9202e70

JohannesGaessler commited on

cann : fix doxy (ggml/0)
406ac07

ggerganov commited on

vulkan : fix build (llama/0)
e237370

ggerganov commited on

cuda : mark BF16 CONT as unsupported
561bebd

ggerganov commited on

ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)
33c59fc

smeso ggerganov commited on

cmake : set MSVC to use UTF-8 on source files (#2346)
9b3df8e
unverified

Tim Miller commited on

readme : remove invalid flag from Python example (#2396)
5372e8b
unverified

UsernamesLame commited on

readme : fix link (#2394)
ae51c50
unverified

ggerganov commited on

go : add beamsize/entropythold/maxcontext to context interface (#2350)
7efcda7
unverified

hsinhoyeh commited on

talk-llama : sync llama.cpp
4493ffd

ggerganov commited on

whisper : update FA call
2bfec97

ggerganov commited on

sync : ggml
7ba8c97

ggerganov commited on

sync : vulkan (skip) (llama/0)
5fe3dd6

ggerganov commited on

ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)
d64f932

slaren commited on

metal : separate scale and mask from QKT in FA kernel (llama/9189)
90cc3cd

ggerganov commited on

ggml : add SSM Metal kernels (llama/8546)
b6e7294

ggerganov commited on

metal : gemma2 flash attention support (llama/9159)
e62fd15

slaren commited on

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)
fb8ae8b

JohannesGaessler commited on

Add a space to supress a cmake warning (llama/9133)
287612e

qnixsynapse commited on

Add oneDNN primitive support (llama/9091)
b4d8c3e

KevinLy commited on

llama : simplify Mamba with advanced batch splits (llama/8526)
f1abcb4

compilade ggerganov commited on

fallback mmvq (llama/9088)
4b1fda0

hengyu Alberto Cabrera Pérez commited on

Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)
5f43886

zhentaoyu commited on

rpc : print error message when failed to connect endpoint (llama/9042)
d54b156

rgerganov commited on

rpc : prevent crashes on invalid input (llama/9040)
656ae00

rgerganov commited on

ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047)
e0dc1ad

nicoboss commited on

cmake : remove unused option GGML_CURL (llama/9011)
12634fc

ggerganov commited on

ggml : move rope type enum to ggml.h (llama/8949)
9d45f48

danbev slaren commited on

ggml: fix div-by-zero (llama/9003)
d9ee26f

DavidKorczynski commited on

Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (llama/8943)
11bc9e6

Markus Tavenrath OccamRazor commited on

feat: ref. cross entropy, add CUDA, fix grad test (ggml/929)
e1e87a3

JohannesGaessler commited on

ggml: remove bad assert (ggml/928)
ba483f7

JohannesGaessler commited on

examples: add MNIST training + missing ops
0828065

JohannesGaessler commited on

models : add support for wget2 for fedora (#2387)
0653499
unverified

Brad Murray commited on

readme : update the path to bench.py (#2386)
57c7a6b
unverified

Peng commited on

readme : fix typo (#2383)
16e5a16
unverified

ivoputzer commited on

readme : fix broken links in implementation details section (#2382)
4863dee
unverified

stormofice commited on

whisper : fix compile warning for unused params
0e05e03
unverified

ggerganov commited on

sync : ggml vulkan (ggml/0)
c4c7e49

ggerganov commited on

ggml : fix typo in ggml-quants.c comment (ggml/922)
f158bc0

danbev commited on

feat: add new `sin` and `cos` operators (ggml/919)
f541d31

Ronsor ggerganov commited on