UDKAG: Augmenting Large Vision-Language Models with Up-to-Date Knowledge Paper • 2405.14554 • Published May 23, 2024
MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models Paper • 2504.05782 • Published Apr 8, 2025 • 3
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 6 days ago • 53
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 6 days ago • 53