whisper.cpp / ggml-cuda /template-instances /fattn-vec-f32-instance-hs128-f16-q4_0.cu
JohannesGaessler's picture
CUDA: quantized KV support for FA vec (llama/7527)
315df8c
raw
history blame
178 Bytes
// This file has been autogenerated by generate-variants.py, do not edit manually.
#include "../fattn-vec-f32.cuh"
DECL_FATTN_VEC_F32_CASE(128, GGML_TYPE_F16, GGML_TYPE_Q4_0);