Commit History

talk-llama : sync llama.cpp
7ef5ccc

ggerganov commited on

examples : fix compile warnings [no ci] (#0)
32cfce9

ggerganov commited on

sync : ggml
6ef5667

ggerganov commited on

ggml : sync sycl (skip) (#0)
bf6ccee

ggerganov commited on

scripts : fix sync scripts
e2461ca

ggerganov commited on

ggml : remove unnecessary UNUSED macro call (ggml/880)
ab9a7d0

danbev commited on

cmake : add GGML_BUILD and GGML_SHARED macro definitions (llama/8281)
a8f9bda

KafuuChino commited on

Enabled more data types for oneMKL gemm_batch (llama/8236)
08501f8

Ouadie EL FAROUKI commited on

CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278)
8411e3c

JohannesGaessler commited on

CUDA: revert part of the RDNA1 optimizations (llama/8309)
fcd0c52

Daniele commited on

CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)
04d4209

JohannesGaessler commited on

Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)
1ce11e2

KevinLy commited on

rm get_work_group_size() by local cache for performance (llama/8286)
08fd758

Neo Zhang Jianyu arthw commited on

Define and optimize RDNA1 (llama/8085)
6aa5a89

Daniele commited on

fix typo (llama/8267)
0c9c7c8

Judd Judd commited on

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (llama/8258)
cc49462

HanClinto commited on

cuda : update supports_op for matrix multiplication (llama/8245)
2314334

slaren commited on

Fix win build conflict of math library (llama/8230)
5a33963

KevinLy commited on

Fix the sub group size of Intel (llama/8106)
2dd429e

KevinLy commited on

CUDA: refactor and optimize IQ MMVQ (llama/8215)
afa1447

JohannesGaessler commited on

Update SYCL-Rope op and Refactor (llama/8157)
06acee2

zhentaoyu commited on

CUDA: fix MMQ stream-k for --split-mode row (llama/8167)
ef3d018

JohannesGaessler commited on

feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854)
025493b

John Balis slaren commited on

ci : disable java build
b5bb445
unverified

ggerganov commited on

server : add inference path to make OAI API compatible (#2270)
66a3eb1
unverified

eschmidbauer commited on

sync : ggml + fix sync script
bce6859
unverified

ggerganov commited on

make : disable CUDA graphs
ab5ee59
unverified

ggerganov commited on

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (llama/8140)
e83fdad
unverified

slaren commited on

make : disable CUDA mel build
7d13d39
unverified

ggerganov commited on

cmake : minor fixes
369b16c
unverified

ggerganov commited on

make : fix missing -O3
9cccc55
unverified

ggerganov commited on

whisper : disable CUDA mel + fix FFMPEG
2831df8
unverified

ggerganov commited on

sync : ggml
cd6e534
unverified

ggerganov commited on

whisper : reorganize source code + improve CMake (#2256)
f75c2e3
unverified

ggerganov commited on

whisper : optimize fft() function (#2242)
cc603fa
unverified

mky_coder Mike Fan commited on

talk-llama : sync llama.cpp
e8e18fb
unverified

ggerganov commited on

whisper : use ggml_backend_sched (#2239)
bfa5a95

ggerganov slaren commited on

fix : remove extra files
1b0dec0

ggerganov commited on

scripts : sync ggml-blas
463e11c

ggerganov commited on

build : update make / cmake
0b4241c

ggerganov commited on

sync : ggml
89ada87

ggerganov commited on

move BLAS to a separate backend (cont) (llama/6210)
4b26445

slaren commited on

Vulkan Shader Refactor, Memory Debugging Option (llama/7947)
d0120b1

OccamRazor commited on

scripts : stop sync whisper example from ggml
f174613

ggerganov commited on

cmake : fix sycl build (#0)
9a475af

ggerganov commited on

ggml : remove OpenCL (#0)
d303fe3

ggerganov commited on

sycl : sync (#0)
f580c99

ggerganov commited on

cuda : enable CUDA graphs (#0)
d075551

ggerganov commited on

talk-llama : sync llama.cpp
7e268a7

ggerganov commited on

cmake : fix CUDA build (#0)
ddc04a3

ggerganov commited on