Running on Zero Agents Featured 44 RF-DETR Realtime Webcam Demo 🎯 44 Segment objects in live webcam and uploaded media
Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases Paper • 2606.05112 • Published 4 days ago • 3
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration? Paper • 2606.01247 • Published 7 days ago • 29
Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models Paper • 2605.28132 • Published 11 days ago • 25
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 6 days ago • 175
Running on Zero Agents 14 NV-Generate Synthetic Medical Imaging 🧠 14 Synthetic 3D CT and MR generation with NVIDIA NV-Generate.
Running on Zero Agents Featured 215 LTX 2.3 Studio 🎬 215 Generate videos from text, images, audio, or video clips
Running Agents 100 Omni-Video-Factory-API-iframe 🐠 100 Access video creation tools via an embedded interface
Learning A Unified Risk Map for Autonomous Driving in Partially Observable Environments Paper • 2605.22189 • Published 17 days ago • 8
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction Paper • 2605.29341 • Published 10 days ago • 17
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 10 days ago • 60
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 10 days ago • 138
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 10 days ago • 76