Apertus: Democratizing Open and Compliant LLMs for Global Language Environments Paper • 2509.14233 • Published Sep 17, 2025 • 20
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions Paper • 2507.08068 • Published Jul 10, 2025 • 1
Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis Paper • 2407.09835 • Published Jul 13, 2024 • 1
Tulu V1 Suite Collection The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources". • 34 items • Updated Mar 4, 2025 • 3