-
Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum
Paper • 2510.00526 • Published • 11 -
gaotang/figlet_font
Viewer • Updated • 45k • 9 -
gaotang/medical_sft_processed
Viewer • Updated • 23.5k • 14 -
gaotang/numina-cot-subset-67k
Viewer • Updated • 67.6k • 20
Gaotang Li
gaotang
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction upvoted a paper about 5 hours ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards submitted a paper about 5 hours ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable RewardsOrganizations
None yet