JohannesGaessler's picture
CUDA: refactor and optimize IQ MMVQ (llama/8215)
afa1447