Optimize mul_mat for Q4_0 on Intel GPU (llama/12035) 14fd317 Neo Zhang Jianyu arthw commited on Feb 24, 2025