Spaces:
Sleeping
Sleeping
Commit History
CUDA: use mma PTX instructions for FlashAttention (llama/11583) f328957
rpc : early register backend devices (llama/11262) 4134077
CUDA: backwards pass for misc. ops, add tests (llama/11257) 2fbcec1
RoPE: fix back, CUDA support for back + noncont. (llama/11240) 131a21e
GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030) 92311a3
GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64
tts : add OuteTTS support (llama/10784) 8d0f0ac
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) 83a0899
lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on
ggml: load all backends from a user-provided search path (llama/10699) c6de218
Gilad S Diego Devesa commited on
ggml : refactor online repacking (llama/10446) 163128e
ggml : remove old files (skip) (#0) 6284570 unverified
ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034) 154bbc0
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541) bf73242
ggml : add support for dynamic loading of backends (llama/10469) b73266f
ggml: new optimization interface (ggml/988) dd33ace
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921) 3541ee8
Charles Xu Diego Devesa commited on
ggml : build backends as libraries (llama/10256) 3dc93f3
metal : optimize FA kernels (llama/10171) 44ff932
ggml : move CPU backend to a separate file (llama/10144) 0f447f2
Diego Devesa commited on
llama : add simple-chat example (llama/10124) 41ff26f
Diego Devesa Xuan Son Nguyen commited on
llama : use smart pointers for ggml resources (llama/10117) 6b82135
Diego Devesa commited on
ggml : remove ggml_scratch (llama/10121) 3f0b7ba
kompute: add backend registry / device interfaces (llama/10045) b612415
llama : refactor model loader with backend registry (llama/10026) 582a21e
Diego Devesa commited on
ggml : add AMX backend (llama/8998) 1152a79
Adapt to dynamically loadable backends mechanism (llama/9970) f8d4728
leo-pony commited on
Add SYCL Backend registry, device and Event Interfaces (llama/9705) f35cae5
Ouadie EL FAROUKI commited on
add amx kernel for gemm (llama/8998) db52137
vulkan : add backend registry / device interfaces (llama/9721) df2cb6e
Diego Devesa commited on
rpc : add backend registry / device interfaces (llama/9812) 4ac768e
Diego Devesa commited on
ggml : fix BLAS with unsupported types (llama/9775) 0a93e1b
Diego Devesa commited on
ggml : add backend registry / device interfaces to BLAS backend (llama/9752) 7f269bb
Diego Devesa commited on
ggml : add metal backend registry / device (llama/9713) b6adf19
ggml : alloc ggml_contexts on the heap (#2525) 3ccf40a unverified
ggml : fix typo in example usage ggml_gallocr_new (ggml/984) 30a097b
ggml-backend : add device and backend reg interfaces (llama/9707) 9d74d85
Diego Devesa commited on
ggml-backend : add device and backend reg interfaces (llama/9707) 1bdb50a
ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) 52069b8
ggml: refactor cross entropy loss CPU impl. (ggml/976) 2a0805f
metal : reduce command encoding overhead (llama/9698) 43d5a06
test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974) 76aa810
ggml: fix gradient allocation logic (ggml/966) ad3f29d
ggml : add run-time detection of neon, i8mm and sve (llama/9331) 12c0e23
Dan Johansson commited on