ruby : Add parallel transcription support (#3222) acad667 unverified KitaitiMakoto commited on Jun 4, 2025
ci : add mirror for ports.ubuntu.com (ARM packages) (#3221) 17ba7f5 unverified danbev commited on Jun 3, 2025
bindings.java : apply whisperParams in fullTranscribeWithTime instead of ignoring them (#3201) 18fb7d6 unverified Joas Dev commited on Jun 3, 2025
musa: correct MUSA SDK rc4.0.1 download URL (#3217) 90efe84 unverified R0CKSTAR commited on Jun 3, 2025
ci : use mirrors.kernel.org for Ubuntu packages (#3220) 62dd144 unverified danbev commited on Jun 2, 2025
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling (llama/12995) d5d55f2 Max Krasnyansky Diego Devesa commited on May 31, 2025
CUDA: add a prop in ggml_cuda_device_infor for distinguish iGPU or dGPU in cuda (#13856) (llama/13895) a75e157 Shawn yang Yzzzaz JohannesGaessler yangxiao Diego Devesa commited on May 31, 2025
CUDA: fix typo in FlashAttention code (llama/13926) 6fb9674 JohannesGaessler commited on May 30, 2025
sched : avoid changing cur_copy when a graph is already allocated (llama/13922) 1c0a5c0 Diego Devesa commited on May 30, 2025
cuda : prevent using split buffers with 3d/4d matrices (llama/13919) 6b6155b Diego Devesa commited on May 30, 2025
cmake: Guard GGML_CPU_ALL_VARIANTS by architecture (llama/13890) a434936 Christian Kastner commited on May 29, 2025
cmake: Factor out CPU architecture detection (llama/13883) b436dcc Christian Kastner commited on May 29, 2025
ggml: aarch64: Implement SVE F32 kernels for Mamba Sequential Scan Algorithm (llama/13882) bfc960a Vineel Abhinav ggerganov commited on May 29, 2025
ggml: aarch64: Implement SVE F32 kernels for vector functions (llama/13843) 7941e9b Vineel Abhinav commited on May 29, 2025
CUDA: fix FA tg at long context for CC >= 8.9 (llama/13852) d9bd7ce JohannesGaessler commited on May 28, 2025
CANN: Add SOC TYPE printing in cmake configuration (llama/13837) abeb563 leo-pony commited on May 28, 2025
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787) 1ab0f23 lhez commited on May 27, 2025
opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (llama/13790) 4473109 lhez commited on May 27, 2025
vulkan: use timestamp queries for GGML_VULKAN_PERF (llama/13817) 56ddc5b jeffbolznv commited on May 27, 2025
SYCL: add gelu_erf kernel (llama/13749) 49a9b40 Akarshan Biswas Atharva Dubey commited on May 27, 2025
ggml : remove ggml_graph_import and ggml_graph_export declarations (ggml/1247) 3c9a1d2 rgerganov commited on May 30, 2025
ruby : handle build options on installation (#3206) 7953154 unverified KitaitiMakoto commited on May 29, 2025
ggml : Fix backtrace breaking Windows build (#3203) 3f352bd unverified Daniel Tang commited on May 29, 2025
ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 1459465 Daniel Tang commited on May 28, 2025
whisper : remove whisper_load_backends function (#3196) 0cae2d6 unverified danbev commited on May 29, 2025
ruby : add VAD support, migration to Ruby's newer API (#3197) 1ee7297 unverified KitaitiMakoto commited on May 28, 2025
whisper : install shared libs when using GGML_BACKEND_DL (#3195) f44915b unverified peardox commited on May 28, 2025
tests : add a new benchmark test for long-form audio (#3185) 2d5018c unverified fujimotos commited on May 28, 2025
ggml-cpu: x86 feature detection is specific to x86 (llama/13811) d86ba47 Christian Kastner commited on May 27, 2025
ggml : allow CUDA graphs when using pipeline parallelism (llama/13814) b85e3c0 Diego Devesa commited on May 27, 2025
SYCL: Add non contiguous support in RMS_NORM and NORM kernels (llama/13611) 5de15cd Akarshan Biswas commited on May 26, 2025