FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
• 2402.10986
• Published
• 81
Aria Everyday Activities Dataset
Paper
• 2402.13349
• Published
• 31
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Paper
• 2403.04132
• Published
• 40
SaulLM-7B: A pioneering Large Language Model for Law
Paper
• 2403.03883
• Published
• 90
VideoAgent: Long-form Video Understanding with Large Language Model as
Agent
Paper
• 2403.10517
• Published
• 37
RAFT: Adapting Language Model to Domain Specific RAG
Paper
• 2403.10131
• Published
• 72
Med42-v2: A Suite of Clinical LLMs
Paper
• 2408.06142
• Published
• 52
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
• 2408.06292
• Published
• 128
Sapiens: Foundation for Human Vision Models
Paper
• 2408.12569
• Published
• 94
Law of Vision Representation in MLLMs
Paper
• 2408.16357
• Published
• 95
CogVLM2: Visual Language Models for Image and Video Understanding
Paper
• 2408.16500
• Published
• 57
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper
• 2408.15545
• Published
• 38
From MOOC to MAIC: Reshaping Online Teaching and Learning through
LLM-driven Agents
Paper
• 2409.03512
• Published
• 29
WildVision: Evaluating Vision-Language Models in the Wild with Human
Preferences
Paper
• 2406.11069
• Published
• 14
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at
Any Resolution
Paper
• 2409.12191
• Published
• 78
LLMs + Persona-Plug = Personalized LLMs
Paper
• 2409.11901
• Published
• 35
Vista3D: Unravel the 3D Darkside of a Single Image
Paper
• 2409.12193
• Published
• 10
bartowski/Sky-T1-32B-Preview-GGUF
Text Generation
• 33B • Updated
• 574
• 82
Paper
• 2502.06049
• Published
• 31
Text Generation
• Updated
• 1.59k
• • 532
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning
for LLMs
Paper
• 2510.11696
• Published
• 181
Paper
• 2510.18212
• Published
• 36
MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment
Paper
• 2512.09636
• Published
• 26
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics
Paper
• 2512.13660
• Published
• 37
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Paper
• 2601.06002
• Published
• 55
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning
Paper
• 2602.07845
• Published
• 69
Text Generation
• 754B • Updated
• 180k
• • 1.51k
GLM-5: from Vibe Coding to Agentic Engineering
Paper
• 2602.15763
• Published
• 87
MiMo-V2-Flash Technical Report
Paper
• 2601.02780
• Published
• 35