OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows Paper • 2510.24411 • Published Oct 28, 2025 • 71
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published Oct 27, 2025 • 96
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters Paper • 2507.13618 • Published Jul 18, 2025 • 16
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published Aug 20, 2025 • 85
Seed-X Collection A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22, 2025 • 65
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective Paper • 2505.21505 • Published May 27, 2025 • 18
R-PRM Collection R-PRM: Reasoning-Driven Process Reward Modeling • 3 items • Updated Mar 31, 2025 • 3
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 241
MAPO: Multilingual Reasoning with Preference Optimization Collection MAPO: Advancing Multilingual Reasoning through Multilingual Alignment‑as‑Preference Optimization • 10 items • Updated Mar 26, 2024 • 3