Andreas Stöffelbauer's picture

14

Andreas Stöffelbauer

andreasskyscanner

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Context Training with Active Information Seeking

upvoted a paper 3 days ago

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

upvoted a paper 30 days ago

Predicting integers from continuous parameters

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Context Training with Active Information Seeking

Paper • 2605.13050 • Published 9 days ago • 7

upvoted a paper 3 days ago

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

Paper • 2605.13511 • Published 9 days ago • 32

upvoted a paper 30 days ago

Predicting integers from continuous parameters

Paper • 2602.10751 • Published Apr 13 • 3

upvoted 6 papers about 1 month ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 102

You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass

Paper • 2604.10966 • Published Apr 13 • 12

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

Paper • 2604.13010 • Published Apr 14 • 16

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 106

p1: Better Prompt Optimization with Fewer Prompts

Paper • 2604.08801 • Published Apr 9 • 9

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published Apr 2 • 42

upvoted 2 papers about 2 months ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 54

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

upvoted a paper 5 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 32

upvoted 2 papers 7 months ago

Hybrid Architectures for Language Models: Systematic Analysis and Design Insights

Paper • 2510.04800 • Published Oct 6, 2025 • 37

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

Paper • 2510.03264 • Published Sep 26, 2025 • 25