Spaces:
Running
Running
Commit History
tests: add tests for GGUF (llama/10830)
e7722cb
ggml : add check for grad_accs (ggml/1046)
eacc95c
ggml : refactor online repacking (llama/10446)
163128e
ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)
154bbc0
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)
bf73242
ggml : add support for dynamic loading of backends (llama/10469)
b73266f
cuda : optimize argmax (llama/10441)
69ae50d
ggml-opt: fix data corruption (ggml/1022)
a916e92
ggml : fix compile warnings (llama/0)
80d6ec0
ggml: new optimization interface (ggml/988)
dd33ace
ggml : fix some build issues
c5ba1d1
slaren
commited on
ggml : build backends as libraries (llama/10256)
3dc93f3
metal : optimize FA kernels (llama/10171)
44ff932
ggml : adjust is_first_call init value (llama/10193)
7e2b09b
ggml : fix arch check in bf16_to_fp32 (llama/10164)
09e4a9b
Diego Devesa
commited on
ggml : fix q4xx mat mul, increase ggml_aligned_malloc alignment (llama/10167)
ba20d5c
Diego Devesa
commited on
ggml : move CPU backend to a separate file (llama/10144)
0f447f2
Diego Devesa
commited on
ggml : remove ggml_scratch (llama/10121)
3f0b7ba
llama : fix buffer checks for mamba and rwk (llama/10111)
9df9767
Diego Devesa
commited on
ggml : check tensor name lengths in gguf files (llama/10100)
0b78224
Diego Devesa
commited on
ggml : fix memory leaks when loading invalid gguf files (llama/10094)
f9baffc
Diego Devesa
commited on
llama : refactor model loader with backend registry (llama/10026)
582a21e
Diego Devesa
commited on
CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021)
bcbaad3
ggml : add asserts for type conversion in fattn kernels (llama/9971)
9542e42
add amx kernel for gemm (llama/8998)
db52137
fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)
cf75979
Gilad S
commited on
ggml : fix BLAS with unsupported types (llama/9775)
0a93e1b
Diego Devesa
commited on
ggml : alloc ggml_contexts on the heap (#2525)
3ccf40a
unverified
ggml-backend : add device and backend reg interfaces (llama/9707)
9d74d85
Diego Devesa
commited on
ggml-backend : add device and backend reg interfaces (llama/9707)
1bdb50a
ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980)
52069b8
ggml: refactor cross entropy loss CPU impl. (ggml/976)
2a0805f
test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)
76aa810
ggml : fix ggml_cast (ggml/973)
c44d575
ggml: fix gradient allocation logic (ggml/966)
ad3f29d
ggml : define missing HWCAP flags (llama/9684)
1d52105
ggml : add run-time detection of neon, i8mm and sve (llama/9331)
12c0e23
Dan Johansson
commited on
threads: fix msvc build without openmp (llama/9615)
97b3eb5
Max Krasnyansky
commited on
threads: improve ggml_barrier scaling with large number of threads (llama/9598)
aca04d5
Max Krasnyansky
commited on
examples : adapt to ggml.h changes (ggml/0)
91c7734
ggml : refactoring (llama/#0)
1b62c96
ggml : fix n_threads_cur initialization with one thread (llama/9538)
af82b69
slaren
Max Krasnyansky
commited on
threadpool : skip polling for unused threads (llama/9461)
9d11a7a
Max Krasnyansky
commited on
ggml : ggml_type_name return "NONE" for invalid values (llama/9458)
8a1bb27
Yuri Khrustalev
commited on
riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442)
f77ad34
Ahmad Tameem
commited on