Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Paper • 2403.03950 • Published Mar 6, 2024 • 15
Mixtures of Experts Unlock Parameter Scaling for Deep RL Paper • 2402.08609 • Published Feb 13, 2024 • 36
Bigger, Better, Faster: Human-level Atari with human-level efficiency Paper • 2305.19452 • Published May 30, 2023 • 4