JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents Paper • 2208.13266 • Published Aug 28, 2022 • 1
Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond Paper • 2408.11338 • Published Aug 21, 2024
T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation Paper • 2306.00905 • Published Jun 1, 2023 • 1
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published 11 days ago • 51
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published 11 days ago • 51