whisper.cpp / ggml-cuda /template-instances
19.2 kB
JohannesGaessler's picture
CUDA: quantized KV support for FA vec (llama/7527)
315df8c