Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 9 days ago • 59
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics Paper • 2411.16537 • Published Nov 25, 2024 • 1
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 10 days ago • 87
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks Paper • 2605.24218 • Published 15 days ago • 42
TactAlign: Human-to-Robot Policy Transfer via Tactile Alignment Paper • 2602.13579 • Published Feb 14 • 11
Watch and Learn: Learning to Use Computers from Online Videos Paper • 2510.04673 • Published Oct 6, 2025 • 12
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL Paper • 2512.04069 • Published Dec 3, 2025 • 24
An Illusion of Progress? Assessing the Current State of Web Agents Paper • 2504.01382 • Published Apr 2, 2025 • 4
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published Jun 26, 2025 • 52