eepos's picture

eepos

eepos

·

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

upvoted a paper 3 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

liked a model 5 days ago

AesSedai/Qwen3.5-397B-A17B-GGUF

View all activity

Organizations

None yet

upvoted an article 2 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

3 days ago

•

292

upvoted a paper 3 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 9 days ago • 49

upvoted 3 collections 6 days ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5-397B-A17B. • 2 items • Updated 6 days ago • 8

Hibiki-Zero

4 items • Updated 10 days ago • 2

Qwen3.5

2 items • Updated 4 days ago • 186

upvoted 2 collections about 1 month ago

TranslateGemma

3 items • Updated Jan 15 • 212

Text-To-Speech

https://kyutai.org/next/tts • 7 items • Updated 6 days ago • 25

upvoted a collection about 2 months ago

FLUX.2

Our second generation of FLUX • 17 items • Updated Jan 18 • 129

upvoted a collection 2 months ago

CASA

CASA: Cross-Attention as Self-Attention for Efficient Vision-Language Fusion on long context streaming inputs • 6 items • Updated Dec 23, 2025 • 7

upvoted an article 2 months ago

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

124

upvoted 2 collections 3 months ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 91

Qwen-Image

14 items • Updated Dec 31, 2025 • 70

upvoted a paper 5 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 118

upvoted a paper 8 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24, 2025 • 42

upvoted a paper 9 months ago

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27, 2025 • 45

upvoted a collection 9 months ago

Instruction-tuned models (GuidedQuant)

40 items • Updated Sep 6, 2025 • 2

upvoted 2 collections 10 months ago

EXL3 models

46 items • Updated Jan 10 • 37

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Jun 30, 2025 • 134

upvoted 2 papers 10 months ago

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published Apr 10, 2025 • 62

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11, 2025 • 130