Visual Funnel: Resolving Contextual Blindness in Multimodal Large Language Models Paper • 2512.10362 • Published 20 days ago • 1
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 22 days ago • 115
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published Jul 23 • 50