MiroThinker-v1.5 Collection MiroMind’s Open Source Research Agent for Prediction • 4 items • Updated about 22 hours ago • 23
DSR_Suite Collection Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models • 3 items • Updated 24 days ago • 6
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published 24 days ago • 49
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 183
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning Paper • 2510.11027 • Published Oct 13, 2025 • 21
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints Paper • 2510.08565 • Published Oct 9, 2025 • 19
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published Dec 12, 2024 • 38