Scaling Law for Quantization-Aware Training
Paper
•
2505.14302
•
Published
•
76
Paper
•
2505.14674
•
Published
•
37
Paper
•
2505.09388
•
Published
•
320
AdaptThink: Reasoning Models Can Learn When to Think
Paper
•
2505.13417
•
Published
•
83
Thinkless: LLM Learns When to Think
Paper
•
2505.13379
•
Published
•
50
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper
•
2505.03335
•
Published
•
188
Seed1.5-VL Technical Report
Paper
•
2505.07062
•
Published
•
154
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable
Speaker Encoder
Paper
•
2505.07916
•
Published
•
134
Chain-of-Model Learning for Language Model
Paper
•
2505.11820
•
Published
•
121
Emerging Properties in Unified Multimodal Pretraining
Paper
•
2505.14683
•
Published
•
133
Parallel Scaling Law for Language Models
Paper
•
2505.10475
•
Published
•
83
Flow-GRPO: Training Flow Matching Models via Online RL
Paper
•
2505.05470
•
Published
•
86
RM-R1: Reward Modeling as Reasoning
Paper
•
2505.02387
•
Published
•
79
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Paper
•
2505.04588
•
Published
•
65
Scaling Reasoning, Losing Control: Evaluating Instruction Following in
Large Reasoning Models
Paper
•
2505.14810
•
Published
•
62
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
Concept Space
Paper
•
2505.15778
•
Published
•
18
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop
System from Hypothesis to Verification
Paper
•
2505.16938
•
Published
•
120
Learning to Reason via Mixture-of-Thought for Logical Reasoning
Paper
•
2505.15817
•
Published
•
18
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Paper
•
2505.18129
•
Published
•
61
MemOS: A Memory OS for AI System
Paper
•
2507.03724
•
Published
•
157
4KAgent: Agentic Any Image to 4K Super-Resolution
Paper
•
2507.07105
•
Published
•
105
A Survey of Context Engineering for Large Language Models
Paper
•
2507.13334
•
Published
•
259