CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection
Paper • 2605.16839 • Published • 12
None defined yet.
Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling?
Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection