support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (llama/6521) 873102e unverified Neo Zhang Jianyu commited on Apr 7, 2024
Fixed minor bug when enabling FP16 for non intel targets (llama/6464) f84edd5 unverified Ouadie EL FAROUKI AidanBeltonS commited on Apr 5, 2024
llama : add pipeline parallelism support (llama/6017) b5bb3f3 unverified slaren compilade ggerganov commited on Mar 13, 2024
ggml : reuse quantum structs across backends (llama/5943) bb0625f unverified ggerganov commited on Mar 12, 2024
sycl : update IQ1_S kernels (WIP - not working!) (llama/5995) 16dc72c unverified ggerganov commited on Mar 12, 2024
ggml : add ggml-common.h to deduplicate shared code (llama/5940) 0a37735 unverified ggerganov commited on Mar 9, 2024
Revert "[SYCL] fix error when set main gpu to non-zero (llama/5901)" (llama/5918) d7e8525 unverified Neo Zhang Jianyu commited on Mar 7, 2024
fix error when set main gpu to non-zero (llama/5901) 829c347 unverified Neo Zhang Jianyu commited on Mar 7, 2024
add wait() to make code stable (llama/5895) 41c3c12 unverified Neo Zhang Jianyu commited on Mar 6, 2024
fix mul_mat fault in CI/unit-test (llama/5862) 91bb65e unverified Neo Zhang Jianyu jinliangtao compilade Cebtenzzre Xuan Son Nguyen ggerganov Kawrakow ikawrakow Cebtenzzre Michael Podvitskiy phymbert github-actions[bot] Nindaleth Black_Fox iamlemec slaren dranger003 leejet Minsoo Cheong Dane Madsen hutli emozilla commited on Mar 5, 2024
ggml : introduce ggml_status (ggml/750) 151c676 unverified Michael Podvitskiy slaren ggerganov commited on Mar 4, 2024
Support multiple GPUs (split mode) on SYCL backend (llama/5806) b1865d2 unverified Neo Zhang Jianyu commited on Mar 2, 2024
Use batched mul_mat pathway (llama/5591) 4a30367 unverified AidanBeltonS Abhilash Majumder commited on Mar 1, 2024
Add support for soft_max ALiBi (llama/5639) 86d6a5e unverified AidanBeltonS Abhilash Majumder commited on Feb 26, 2024
Update ggml_sycl_op_mul_mat_vec_q (llama/5502) 963ffd5 unverified AidanBeltonS Abhilash Majumder commited on Feb 20, 2024
ggml-sycl: Replace 3d ops with macro (llama/5458) 12970f1 unverified Abhilash Majumder commited on Feb 12, 2024