Spaces:
Running
Running
Commit History
sync : ggml 96e8b15
ggml-backend : fix async copy from CPU (llama/8897) 050174c
slaren commited on
Updated SYCL device filtering (llama/8901) 64976cd
Ouadie EL FAROUKI commited on
CUDA/HIP: fix tests/test-backend-ops (llama/8896) f14c1ad
CUDA: fix padding logic for FP16/FP32 (llama/8884) 643bcdb
ggml : add epsilon as a parameter for group_norm (llama/8818) d003891
ggml : fix overflows in elu function (llama/8866) a12468a
Justine Tunney commited on
ggml : reading the runtime sve config of the cpu (llama/8709) c26339f
jdomke domke commited on
Fix conversion of unnormalized BF16->BF16 weights (llama/7843) 8b10f59
Fixing wrong VDR iq4nl value (llama/8812) 30eb7bc
Ouadie EL FAROUKI commited on
ggml-cuda: Adding support for unified memory (llama/8035) 686bb18
Build: Only include execinfo.h on linux systems that support it (llama/8783) 0019ddb
Alex O'Connell commited on
cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800) 73e80d1
slaren commited on
added android implementation of ggml_print_backtrace_symbols (llama/8751) 314d58a
cann: update cmake (llama/8765) 345a58d
Add `TIMESTEP_EMBEDDING` OP (llama/8707) 52eea23
ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748) 6989631
cuda : organize vendor-specific headers into vendors directory (llama/8746) ec2f307
R0CKSTAR commited on
add conv support (llama/8688) f0d6f5c
feat: Support Moore Threads GPU (llama/8383) a35db11
ggml : ignore more msvc warnings (ggml/906) 1b11fde
metal : fix struct name (ggml/912) 14cf8db
metal : add abort callback (ggml/905) b822172
vulkan : implement Stable Diffusion operators (ggml/904) 124c156
ggml : move c parameter comment to ggml_rope_ext (ggml/901) 6d34596
ggml : resolve sync conflicst (ggml/0) 82658f5
common : handle new quant types (ggml/0) 53bb541
ggml : add ggml-aarch64 (ggml/0) 0062819
Dibakar Gope commited on
ggml : reduce hash table reset cost (llama/8698) 9808fbf
slaren commited on
ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692) dc51517
DavidKorczynski commited on
fix multi-gpu issue on sycl (llama/8554) 94a6436
ggml : add and use ggml_cpu_has_llamafile() (llama/8664) efcca56
Re-add erroneously removed -fsycl from GGML_EXTRA_LIBS (llama/8667) 6e12dfd
Joe Todd commited on
sycl : Add support for non-release DPC++ & oneMKL (llama/8644) 2a5814c
Joe Todd commited on
Vulkan IQ4_NL Support (llama/8613) 899145d
Allow all RDNA2 archs to use sdot4 intrinsic (llama/8629) 1d65fea
Jeroen Mostert commited on
fix scratch size of softmax (llama/8642) 6519fd2
ggml: fix compile error for RISC-V (llama/8623) 4eec44b
Mark Zhuang commited on
CUDA: MMQ code deduplication + iquant support (llama/8495) 6d14124
gguf : handle null name during init (llama/8587) 2f95156
ggml : fix quant dot product with odd number of blocks (llama/8549) 0083f96
ggml : add friendlier error message to fopen errors (llama/8575) ab5b4e0
CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572) afc137c
cmake : install all ggml public headers (llama/8480) 73a16f3
65a 65a commited on