sycl: Fix and disable more configurations of mul_mat (llama/15151) 7b868ed Romain Biessy commited on Aug 12, 2025
cuda, sycl : fix batched gemm when ne02 == 1 && ne03 > 1 (llama/15038) cc3a2ed ggerganov commited on Aug 2, 2025
SYCL: Add set_rows support for quantized types (llama/14883) c55b72b Akarshan Biswas commited on Jul 28, 2025
sycl: refactor quantization to q8_1 (llama/14815) 31edd77 Alberto Cabrera Pérez commited on Jul 28, 2025
sycl: fixed semantics of block offset calculation (llama/14814) d3d52a4 Alberto Cabrera Pérez commited on Jul 24, 2025
sycl: fix undefined variable in work group size check (llama/14843) bcbbf47 Donghyeon Jeong commited on Jul 24, 2025
use max work group size for device to replace the magic number (llama/14732) e5e9b79 Neo Zhang Jianyu commited on Jul 18, 2025
sycl: Batched mulmat rework for oneDNN dispatch (llama/14617) 2722bea Anton Mitkov commited on Jul 14, 2025
ggml : add build-time message to remind about ggml_set_rows (llama/14661) 0f5d4ba ggerganov commited on Jul 13, 2025
SYCL: Initial set_rows kernel implementation (llama/14562) e62ef85 Akarshan Biswas commited on Jul 10, 2025
cuda : fix rope with partial rotation and non-cont src (llama/14580) aaf2d96 ggerganov commited on Jul 8, 2025
ggml : implement GEGLU_ERF and GEGLU_QUICK ops (llama/14445) f798922 Sigbjørn Skjæret commited on Jul 3, 2025
Fix conditional enabling following arch checks for ggml-sycl (llama/14504) 1f15602 Nicolò Scipione commited on Jul 3, 2025
ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (llama/14435) ebacb3e ggerganov commited on Jul 12, 2025
ggml : implement REGLU/GEGLU/SWIGLU ops (llama/14158) add5c0f Sigbjørn Skjæret ggerganov OccamRazor Akarshan jeffbolznv commited on Jun 29, 2025
sycl: GGML_SYCL_DISABLE_OPT on by default for all Intel Devices (llama/13973) b25d3bf Anton Mitkov commited on Jun 25, 2025
sycl: add usage of enqueue_functions extension (llama/14244) 2e59a96 Nicolò Scipione commited on Jun 20, 2025
sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215) feee739 Anton Mitkov commited on Jun 19, 2025
sycl: Adding additional cpy dbg print output (llama/14034) 6799437 Anton Mitkov commited on Jun 13, 2025
sycl: Remove not needed copy f16->f32 for dnnl mul mat (llama/14125) eed049f Anton Mitkov commited on Jun 12, 2025
ggml-cpu : split arch-specific implementations (llama/13892) 8c833e9 xctan ggerganov commited on Jun 9, 2025
sycl: Add reorder to Q6_K mmvq implementation (llama/13885) 56f0e48 Nicolò Scipione commited on Jun 9, 2025
SYCL: Implement few same quantized type copy kernels (llama/13739) 4c88a27 Akarshan Biswas commited on Jun 7, 2025
sycl: quantize and reorder the input to q8_1 when reorder is enabled (llama/13826) c4e62cd Atharva Dubey Alberto Cabrera Pérez commited on Jun 2, 2025
SYCL: add gelu_erf kernel (llama/13749) 49a9b40 Akarshan Biswas Atharva Dubey commited on May 27, 2025
SYCL: Add non contiguous support in RMS_NORM and NORM kernels (llama/13611) 5de15cd Akarshan Biswas commited on May 26, 2025
SYCL: revert "sycl: simplify bin_bcast_kernel (ggml/13383)" (llama/13752) 8c2a700 Akarshan Biswas commited on May 25, 2025
sycl : Remove waits from function calls (llama/13702) b9bf6b6 Nicolò Scipione commited on May 22, 2025
SYCL: Avoid using with SYCL-Graph for unsupported nodes (llama/13587) 7eb0e6e Ewan Crawford commited on May 22, 2025
sycl: disable reorder for sycl mulmat (llama/13536) e023dc2 Svetlozar Georgiev commited on May 20, 2025
sycl : Overcoming workaround for mmap() allocation on Windows (llama/13482) bf74ede Nicolò Scipione commited on May 20, 2025
sycl: use oneDNN for matrices multiplication (llama/12972) 2008e08 Łukasz Ślusarczyk commited on May 15, 2025
enable dpcpp nightly builds with libraries (llama/13406) c9c1196 Atharva Dubey commited on May 12, 2025