Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 3 days ago • 68
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Paper • 2603.03205 • Published 3 days ago • 11
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 8 days ago • 28