Running on Zero Featured 105 SAM3 Video Segmentation 🐠 105 Track and label objects in videos using text prompts or clicks
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 234k • 1.57k
Running on Zero MCP Featured 211 ViTPose Transformers ⚡ 211 Detect and estimate human poses in images and videos
Running on Zero Featured 578 Chat with DeepSeek-VL2-small 🌍 578 Generate responses using images and text input