whisper.cpp / ggml /src /ggml-vulkan

Commit History

vulkan : sync (llama/0)
4c17fa1

ggerganov commited on

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852

William Tambellini slaren commited on

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (llama/11595)
d7d82b9

Rémy O commited on

vulkan: matmul dequantization improvements (llama/12015)
ffdf466

Eve commited on

vulkan: improve im2col (llama/11826)
f6cff0a

Daniele commited on

vulkan: fix assertion when qy_needs_dequant (llama/12068)
271c7e4

jeffbolznv commited on

cuda/vulkan: specify fp32-only support for some operations in supports_op (ggml/1129)
f959b90

cmdr2 commited on

vulkan: implement several ops relevant for ggml_opt (llama/11769)
3c2171d

Rémy O commited on

vulkan: support multi/vision rope, and noncontiguous rope (llama/11902)
1c7a669

jeffbolznv commited on

vulkan: initial support for IQ1_S and IQ1_M quantizations (llama/11528)
0d2e888

Rémy O commited on

vulkan: linux builds + small subgroup size fixes (llama/11767)
e3f0e78

Eve commited on

vulkan: Make Vulkan optional at runtime (ggml/11493). (llama/11494)
762f497

Danny Milosavljevic jeffbolznv commited on

vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (llama/11592)
f9fd130

Wagner Bruna commited on

vulkan: account for lookup tables when checking shared memory size (llama/11502)
758970f

jeffbolznv commited on

vulkan: print shared memory size (llama/11719)
fb33a94

jeffbolznv commited on

vulkan: optimize coopmat2 iq2/iq3 callbacks (llama/11521)
3731f13

jeffbolznv commited on

vulkan: initial support for IQ4_XS quantization (llama/11501)
ed46ad5

Rémy O commited on

vulkan: use smaller combined allocations to avoid fragmentation (llama/11551)
1b7672d

jeffbolznv commited on

vulkan: implement initial support for IQ2 and IQ3 quantizations (llama/11360)
bd93c1b

Rémy Oudompheng jeffbolznv commited on

vulkan: Catch pipeline creation failure and print an error message (llama/11436)
d4f6b2c

jeffbolznv commited on

vulkan: compile shaders on-demand (llama/11406)
5c008f7

jeffbolznv commited on

Vulkan-run-test: fix mmq_wg_denoms (llama/11343)
133a580

amd-dwang commited on

vulkan: sort shaders for more deterministic binary (llama/11315)
d7c0046

jeffbolznv commited on

vulkan: fix diag_mask_inf (llama/11323)
f76204e

jeffbolznv commited on

vulkan: fix coopmat2 validation failures (llama/11284)
f2cc7e9

jeffbolznv commited on

vulkan: fix coopmat2 flash attention for non-contiguous inputs (llama/11281)
e0e73fa

jeffbolznv commited on

vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (llama/11166)
3bb9e77

jeffbolznv commited on

vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (llama/11206)
ee122d3

jeffbolznv commited on

vulkan: optimize coopmat2 q2_k dequant function (llama/11130)
d49a569

jeffbolznv commited on

vulkan: scale caching for k quants + misc fixes (llama/11081)
03ab36f

Eve commited on

fix: ggml: fix vulkan-shaders-gen build (llama/10448)
ad8f031

Sparkleholic commited on

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161)
5ad3f1d

OccamRazor commited on

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117)
623b74d

mbaudier commited on

fix: Vulkan shader gen binary path when Cross-compiling (llama/11096)
966a7bb

ag2s20150909 commited on

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074)
4d90c3d

OccamRazor commited on

fix: Vulkan shader gen binary path (llama/11037)
7008fb8

Gilad S. commited on

vulkan: optimize mul_mat for small values of N (llama/10991)
5fc8eea

jeffbolznv commited on

vulkan: im2col and matmul optimizations for stable diffusion (llama/10942)
beef268

jeffbolznv commited on

vulkan: Use push constant offset to handle misaligned descriptors (llama/10987)
04e729a

jeffbolznv commited on

vulkan: multi-row k quants (llama/10846)
3bf5be1

Eve commited on

examples, ggml : fix GCC compiler warnings (llama/10983)
d7cf559

Peter commited on

vulkan: build fixes for 32b (llama/10927)
f1e76ce

jeffbolznv commited on

vulkan: optimize coopmat2 dequant functions (llama/10855)
5e70c43

jeffbolznv commited on

vulkan: bugfixes for small subgroup size systems + llvmpipe test (llama/10809)
9220b51

Eve commited on

rwkv6: add wkv6 support for Vulkan backend (llama/10829)
c7285d6

Zhiyuan Li mollysama commited on

llama : add Qwen2VL support + multimodal RoPE (llama/10361)
219d12b

RzZ ggerganov commited on

vulkan: small mul_mat_vec optimizations (llama/10665)
ec98109

Eve commited on

Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (llama/10798)
a812efc

OccamRazor commited on