Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published Feb 27, 2025 • 32
Robotouille: An Asynchronous Planning Benchmark for LLM Agents Paper • 2502.05227 • Published Feb 6, 2025
MOSAIC: A Modular System for Assistive and Interactive Cooking Paper • 2402.18796 • Published Feb 29, 2024 • 25
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought Paper • 2305.16744 • Published May 26, 2023 • 1