Ritvik Rastogi's picture

12 2 2

Ritvik Rastogi

Ritvik19

·

https://ritvik19.github.io

AI & ML interests

Machine Learning Deep Learning, Natural Language Processing, Computer Vision

Organizations

commented a paper 3 months ago

jina-reranker-v3: Last but Not Late Interaction for Document Reranking

Paper • 2509.25085 • Published Sep 29, 2025 • 7 •

commented 2 papers 6 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 31 •

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 31 •

commented 10 papers 8 months ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published May 10, 2025 • 30 •

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published May 10, 2025 • 30 •

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98 •

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98 •

Process Reward Models That Think

Paper • 2504.16828 • Published Apr 23, 2025 • 18 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

commented 5 papers 9 months ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15, 2025 • 12 •

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15, 2025 • 12 •

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models

Paper • 2504.06214 • Published Apr 8, 2025 •

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26, 2025 • 10 •

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26, 2025 • 10 •

New activity in open-acc/README about 1 year ago

[24/ 11] What are you working on this week! 💪

#2 opened about 1 year ago by

New activity in Ritvik19/openhermes-danube2-sft-qlora over 1 year ago

Adding Evaluation Results

#1 opened over 1 year ago by

leaderboard-pr-bot