CUDA: revise q8_1 data layout for mul_mat_q (llama/7824) fcfd59e JohannesGaessler commited on Jun 9, 2024
llama : add Command R Plus support (llama/6491) 8cf7097 unverified Carolinabanana S S slaren ggerganov commited on Apr 9, 2024