whisper.cpp / ggml /include /ggml-backend.h

Commit History

vulkan: Add fusion support for RMS_NORM+MUL (llama/14366)
737f12d

jeffbolznv slaren commited on

Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (llama/13386)
418769d

David Huang commited on

CUDA: fix logic for clearing padding with -ngl 0 (llama/13320)
c3e51a2

JohannesGaessler commited on

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852

William Tambellini slaren commited on

rpc : early register backend devices (llama/11262)
4134077

rgerganov commited on

ggml: load all backends from a user-provided search path (llama/10699)
c6de218

Gilad S Diego Devesa commited on

ggml : add support for dynamic loading of backends (llama/10469)
b73266f

Diego Devesa ggerganov commited on

ggml: new optimization interface (ggml/988)
dd33ace

JohannesGaessler commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov R0CKSTAR commited on

ggml : move CPU backend to a separate file (llama/10144)
0f447f2

Diego Devesa commited on

llama : refactor model loader with backend registry (llama/10026)
582a21e

Diego Devesa commited on

ggml : add backend registry / device interfaces to BLAS backend (llama/9752)
7f269bb

Diego Devesa commited on

ggml : add metal backend registry / device (llama/9713)
b6adf19

ggerganov slaren commited on

ggml-backend : add device and backend reg interfaces (llama/9707)
9d74d85

Diego Devesa commited on

ggml-backend : add device and backend reg interfaces (llama/9707)
1bdb50a

Diego Devesa JohannesGaessler commited on

ggml: refactor cross entropy loss CPU impl. (ggml/976)
2a0805f

JohannesGaessler commited on

ggml/examples: add backend support for numerical optimization (ggml/949)
5c178b0

JohannesGaessler ggerganov slaren commited on

Threadpool: take 2 (llama/8672)
e3e9ca4

Faisal Zaghloul Max Krasnyansky quic-fzaghlou Max Krasnyansky slaren commited on

feat: ref. cross entropy, add CUDA, fix grad test (ggml/929)
e1e87a3

JohannesGaessler commited on

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572)
afc137c

JohannesGaessler commited on

whisper : reorganize source code + improve CMake (#2256)
f75c2e3
unverified

ggerganov commited on