A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
AI & ML interests
None defined yet.
Recent Activity
Papers
View all Papers datasets 7
JavisVerse/AV-FineTune
Viewer
• Updated
• 1.43M • 78
JavisVerse/JavisUnd-Eval
Updated
• 111
JavisVerse/MM-PreTrain
Viewer
• Updated
• 340k • 142
JavisVerse/JavisInst-Omni
Viewer
• Updated
• 91.4k • 249 • 1
JavisVerse/JavisBench
Viewer
• Updated
• 22.3k • 64
JavisVerse/JavisData-audios
Viewer
• Updated
• 788k • 28
JavisVerse/TAVGBench_clean
Viewer
• Updated
• 1.58M • 10