A Frustratingly Simple Decoding Method for Neural Text Generation Paper • 2305.12675 • Published May 22, 2023
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Paper • 2311.16511 • Published Nov 25, 2023 • 1
Data Augmentation for Text Generation Without Any Augmented Data Paper • 2105.13650 • Published May 28, 2021
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models Paper • 2401.08294 • Published Jan 16, 2024
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation Paper • 2206.02369 • Published Jun 6, 2022
ALR$^2$: A Retrieve-then-Reason Framework for Long-context Question Answering Paper • 2410.03227 • Published Oct 4, 2024
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30 • 116
Momentum Decoding: Open-ended Text Generation As Graph Exploration Paper • 2212.02175 • Published Dec 5, 2022
Multi-task Learning for Low-resource Second Language Acquisition Modeling Paper • 1908.09283 • Published Aug 25, 2019
Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark Paper • 2411.15488 • Published Nov 23, 2024
Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines Paper • 2411.16365 • Published Nov 25, 2024 • 1
Training Language Models to Critique With Multi-agent Feedback Paper • 2410.15287 • Published Oct 20, 2024
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application Paper • 2510.19631 • Published Oct 22 • 27
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking Paper • 2510.20168 • Published Oct 23 • 27