whisper.cpp / ggml-cuda /quantize.cuh

Commit History

CUDA: revise q8_1 data layout for mul_mat_q (llama/7824)
fcfd59e

JohannesGaessler commited on

llama : add Command R Plus support (llama/6491)
8cf7097
unverified

Carolinabanana S S slaren ggerganov commited on

sync : ggml (#2001)
cbbfa9e
unverified

ggerganov commited on