Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces Paper • 2510.06953 • Published Oct 8, 2025 • 8
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published May 21, 2025 • 104