AI & ML interests
Natural language processing, language models, language agents
Recent Activity
Papers
Automatic Image-Level Morphological Trait Annotation for Organismal Images
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
spaces 4
pinned
Running
Agents
23
Online-Mind2Web Leaderboard
🌐
View agent performance leaderboards and visualizations
Running
Agents
QUEST
🔎
Generate comprehensive answers with web research
Running
Agents
21
TravelPlannerLeaderboard
💻
Display and submit travel planner evaluation results
Sleeping
Agents
4
TravelPlannerEnvironment
👀
Plan a travel itinerary with cost tracking
models 80
osunlp/QUEST-30B-MT-Plus-SFT
Text Generation • 31B • Updated • 65
osunlp/QUEST-30B-SFT
Text Generation • 31B • Updated • 64
osunlp/QUEST-35B-MT
Text Generation • 35B • Updated • 58
osunlp/QUEST-35B-RL
Text Generation • 35B • Updated • 49
osunlp/QUEST-35B-MT-Plus-SFT
Text Generation • 35B • Updated • 46
osunlp/QUEST-35B-SFT
Text Generation • 35B • Updated • 40
osunlp/QUEST-30B-RL
Text Generation • 31B • Updated • 45
osunlp/QUEST-9B
Text Generation • 9B • Updated • 49
osunlp/QUEST-4B
Text Generation • 5B • Updated • 81
osunlp/QUEST-2B
Text Generation • 2B • Updated • 51
datasets 33
osunlp/QUEST-RL-Data
Viewer • Updated • 1.13k • 27
osunlp/QUEST-SFT-Data-Open-ended
Viewer • Updated • 11.9k • 21
osunlp/QUEST-SFT-Data-Objective
Viewer • Updated • 39.9k • 23
osunlp/bioscan-traits
Viewer • Updated • 80.8k • 160 • 2
osunlp/D3-Gym-Trajectories
Viewer • Updated • 6.37k • 65
osunlp/D3-Gym
Viewer • Updated • 565 • 257
osunlp/autoresearch-sab-tasks
Updated • 34
osunlp/ScienceAgentBench
Viewer • Updated • 102 • 1.61k • 19
osunlp/GUI-Drag-dataset
Preview • Updated • 117 • 4
osunlp/MisActBench
Updated • 123 • 2