PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16 • 106
FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs Paper • 2508.17906 • Published Aug 25 • 4
view article Article System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience Jun 2 • 21
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper • 2504.14538 • Published Apr 20 • 30
Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Paper • 2405.20216 • Published May 30, 2024 • 21
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Paper • 2503.20308 • Published Mar 26 • 23
Exploring the Evolution of Physics Cognition in Video Generation: A Survey Paper • 2503.21765 • Published Mar 27 • 11
Towards Scientific Discovery with Generative AI: Progress, Opportunities, and Challenges Paper • 2412.11427 • Published Dec 16, 2024 • 3