Commit History

scripts : update sync
cbec24b

ggerganov commited on

ci : fix building workflow for linux/arm64 container (#2555)
37cb027
unverified

Raiya Araki commited on

ruby : extend API (#2551)
82ceab0
unverified

KitaitiMakoto commited on

whisper.swiftui : add model download list & bench methods (#2546)
5767578
unverified

jhenhong commited on

ruby : fix the instructions (#2548)
3c4a8bb
unverified

wilsonsilva commited on

ggml : vulkan logs (#2547)
aec2a89
unverified

thewh1teagle commited on

examples : fix ffmpeg v5 build (#2543)
88b342e
unverified

Stefan Sydow commited on

whisper : fix extra memory usage (#2534)
96efeba
unverified

Vin Misra vinmisra commited on

whisper : backend registry init before model load
606cb52

ggerganov commited on

talk-llama : sync llama.cpp
8177527

ggerganov commited on

sync : ggml
80b373b

ggerganov commited on

ggml : add AMX backend (llama/8998)
1152a79

mingfeima commited on

metal : support permuted matrix multiplicaions (llama/10033)
efb86a3

ggerganov commited on

CUDA: fix insufficient buffer clearing for MMQ (llama/10032)
a41f94c

JohannesGaessler commited on

CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021)
bcbaad3

JohannesGaessler commited on

increase cuda_cpy block size (ggml/996)
60f512e

bssrdf bssrdf commited on

metal : add POOL2D and fix IM2COL (llama/9943)
b553b89

newfrisbie commited on

Adapt to dynamically loadable backends mechanism (llama/9970)
f8d4728

leo-pony commited on

ggml : add asserts for type conversion in fattn kernels (llama/9971)
9542e42

ggerganov commited on

rpc : pack only RPC structs (llama/9959)
6bdbd69

rgerganov commited on

fix mul_mat_vec_q and *_vec_q error (llama/9939)
691e6ac

Neo Zhang Jianyu arthw commited on

rpc : backend refactoring (llama/9912)
b6c412f

rgerganov commited on

Add SYCL Backend registry, device and Event Interfaces (llama/9705)
f35cae5

Ouadie EL FAROUKI commited on

add amx kernel for gemm (llama/8998)
db52137

mingfeima commited on

vulkan : add backend registry / device interfaces (llama/9721)
df2cb6e

Diego Devesa commited on

fix: allocating CPU buffer with size `0` (llama/9917)
ae9a15f

Gilad S commited on

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)
cf75979

Gilad S commited on

CUDA: fix 1D im2col, add tests (ggml/993)
c24f7b1

JohannesGaessler commited on

Fix cann compilation error (llama/9891)
b480790

leo-pony commited on

Vectorize load instructions in dmmv f16 CUDA kernel (llama/9816)
ddb0222

agray3 JohannesGaessler commited on

ggml : move more prints to the ggml log system (llama/9839)
98d1a6a

Diego Devesa commited on

rpc : add backend registry / device interfaces (llama/9812)
4ac768e

Diego Devesa commited on

musa: add docker image support (llama/9685)
553b278

R0CKSTAR commited on

ggml : fix BLAS with unsupported types (llama/9775)
0a93e1b

Diego Devesa commited on

ggml : add backend registry / device interfaces to BLAS backend (llama/9752)
7f269bb

Diego Devesa commited on

Update building for Android (llama/9672)
27e2fca

Andrew Minh Nguyen commited on

ggml : add metal backend registry / device (llama/9713)
b6adf19

ggerganov slaren commited on

metal : single allocation of encode_async block (llama/9747)
6e1b44c

Paul Tsochantaris ggerganov commited on

ggml-alloc : remove buffer_id from leaf_alloc (ggml/987)
1a776cc

danbev commited on

scripts : sync amx
f819a74
unverified

ggerganov commited on

ggml : alloc ggml_contexts on the heap (#2525)
3ccf40a
unverified

ggerganov commited on

ci : fix openblas build (#2511)
b116fe7
unverified

ggerganov Tamotsu Takahashi commited on

scripts : add turbo-q8_0 to the benchmark
e923761
unverified

ggerganov commited on

whisper : minor compile warning
c06d5f9
unverified

ggerganov commited on

whisper : move new-segment callback after DTW step (#2515)
6b8369d
unverified

jettoblack commited on

ruby : fix installation test (#2519)
403eb98
unverified

KitaitiMakoto commited on

ruby : add more APIs (#2518)
8aaba9a
unverified

KitaitiMakoto commited on

ruby : support new-segment callback (#2506)
ae07b89
unverified

KitaitiMakoto commited on

ruby : add Metal support (#2516)
bb2a01d
unverified

KitaitiMakoto commited on

whisper : fix index overflow in token-level timestamp logic (#2505)
de79ec1
unverified

josscii commited on