whisper.cpp / ggml /src /ggml-backend.cpp

Commit History

ggml : fix fallback to CPU for ununsupported ops (llama/15118)
2b7ae5e

Diego Devesa commited on

sched : fix multiple evaluations of the same graph with pipeline parallelism (llama/14855)
e9f5612

Diego Devesa commited on

metal : fuse add, mul + add tests (llama/14596)
66ae493

ggerganov commited on

vulkan: Add fusion support for RMS_NORM+MUL (llama/14366)
737f12d

jeffbolznv slaren commited on

sched : avoid changing cur_copy when a graph is already allocated (llama/13922)
1c0a5c0

Diego Devesa commited on

ggml : allow CUDA graphs when using pipeline parallelism (llama/13814)
b85e3c0

Diego Devesa commited on

llama/ggml: add LLM training support (llama/10544)
8d3b3c1

JohannesGaessler commited on

Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (llama/13386)
418769d

David Huang commited on

CUDA: fix logic for clearing padding with -ngl 0 (llama/13320)
c3e51a2

JohannesGaessler commited on

ggml : portability fixes for VS 2017 (llama/12150)
49e3343

mgroeber9110 Marcus Groeber commited on

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852

William Tambellini slaren commited on

ggml-backend : only offload from host buffers (fix) (llama/11124)
9ac3c7e

Diego Devesa commited on

ggml-backend : only offload from host buffers (llama/11120)
1ca87a8

Diego Devesa commited on

ggml : improve inputs log sched_print_assignments (ggml/1053)
4427ede

danbev commited on

ggml : move AMX to the CPU backend (llama/10570)
3732429

Diego Devesa commited on

ggml-opt: fix data corruption (ggml/1022)
a916e92

JohannesGaessler commited on

ggml/sched : do not skip views in pre-assignments
b1eba61

slaren commited on

ggml : sync resolve (skip) (#0)
d4d67dc

ggerganov commited on

llama : only use default buffer types for the KV cache (llama/10358)
9e9c0ad

Diego Devesa commited on

ggml : fix possible buffer use after free in sched reserve (llama/9930)
4703ea3

Diego Devesa commited on

ggml: new optimization interface (ggml/988)
dd33ace

JohannesGaessler commited on

ggml : tmp workaround for whisper.cpp (skip) (#2565)
ef26f48
unverified

ggerganov commited on

ggml : move CPU backend to a separate file (llama/10144)
0f447f2

Diego Devesa commited on

llama : fix buffer checks for mamba and rwk (llama/10111)
9df9767

Diego Devesa commited on

kompute: add backend registry / device interfaces (llama/10045)
b612415

slpnix commited on

llama : refactor model loader with backend registry (llama/10026)
582a21e

Diego Devesa commited on

Adapt to dynamically loadable backends mechanism (llama/9970)
f8d4728

leo-pony commited on

Add SYCL Backend registry, device and Event Interfaces (llama/9705)
f35cae5

Ouadie EL FAROUKI commited on

add amx kernel for gemm (llama/8998)
db52137

mingfeima commited on

vulkan : add backend registry / device interfaces (llama/9721)
df2cb6e

Diego Devesa commited on

fix: allocating CPU buffer with size `0` (llama/9917)
ae9a15f

Gilad S commited on

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)
cf75979

Gilad S commited on

ggml : move more prints to the ggml log system (llama/9839)
98d1a6a

Diego Devesa commited on

rpc : add backend registry / device interfaces (llama/9812)
4ac768e

Diego Devesa commited on

ggml : fix BLAS with unsupported types (llama/9775)
0a93e1b

Diego Devesa commited on

ggml : add backend registry / device interfaces to BLAS backend (llama/9752)
7f269bb

Diego Devesa commited on

ggml : add metal backend registry / device (llama/9713)
b6adf19

ggerganov slaren commited on

ggml-backend : add device and backend reg interfaces (llama/9707)
9d74d85

Diego Devesa commited on

ggml-backend : add device and backend reg interfaces (llama/9707)
1bdb50a

Diego Devesa JohannesGaessler commited on