DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning Paper • 2506.17533 • Published Jun 21, 2025 • 3
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models Paper • 2401.00396 • Published Dec 31, 2023 • 5
RAG-Reward: Optimizing RAG with Reward Modeling and RLHF Paper • 2501.13264 • Published Jan 22, 2025 • 2