Xiang Yue's picture

Xiang Yue

yuexiang96

·

https://xiangyue9607.github.io/

AI & ML interests

NLP/LLMs/LMMs

Recent Activity

authored a paper about 2 months ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

authored a paper about 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

authored a paper about 2 months ago

Simulating Environments with Reasoning Models for Agent Training

View all activity

Organizations

authored 4 papers about 2 months ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 29

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3, 2025 • 2

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 38

upvoted a paper about 2 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 38

commented a paper about 2 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 38 •

liked a dataset 3 months ago

neulab/agent-data-collection

Preview • Updated 23 days ago • 1.61k • 106

upvoted 2 papers 3 months ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24, 2025 • 22

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 29

updated a dataset 5 months ago

neulab/reasoning_traces

Preview • Updated Sep 4, 2025 • 58

published a dataset 5 months ago

neulab/reasoning_traces

Preview • Updated Sep 4, 2025 • 58

New activity in apurvaga/go-browse-wa-qwen-7B 6 months ago

upload tokenizer

#1 opened 6 months ago by

liked a dataset 7 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8, 2025 • 39.9k • 187 • 5

authored 7 papers 7 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17, 2025 • 39

Evaluating Vision-Language Models as Evaluators in Path Planning

Paper • 2411.18711 • Published Nov 27, 2024

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13, 2025 • 24

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Paper • 2503.19877 • Published Mar 25, 2025 • 1

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Paper • 2504.10342 • Published Apr 14, 2025 • 11

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time

Paper • 2504.12329 • Published Apr 12, 2025

Overtrained Language Models Are Harder to Fine-Tune

Paper • 2503.19206 • Published Mar 24, 2025 • 2