JohannesGaessler's picture
CUDA: use mma PTX instructions for FlashAttention (llama/11583)
f328957