Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization Paper • 2206.04007 • Published Jun 8, 2022
Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications Paper • 2407.19262 • Published Jul 27, 2024
Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models Paper • 2502.13313 • Published Feb 9
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection Paper • 2311.09834 • Published Nov 16, 2023
The Art of Embedding Fusion: Optimizing Hate Speech Detection Paper • 2306.14939 • Published Oct 8, 2023
Beyond Negativity: Re-Analysis and Follow-Up Experiments on Hope Speech Detection Paper • 2306.01742 • Published May 10, 2023
Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective Paper • 2604.23267 • Published 19 days ago
Fractional Rotation, Full Potential? Investigating Performance and Convergence of Partial RoPE Paper • 2603.11611 • Published Mar 12
TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability Paper • 2507.19419 • Published Sep 30, 2025
Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction Paper • 2404.12957 • Published Apr 19, 2024
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs Paper • 2412.11763 • Published Dec 16, 2024
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon Paper • 2406.17746 • Published Jun 25, 2024
Rote Learning Considered Useful: Generalizing over Memorized Data in LLMs Paper • 2507.21914 • Published Jul 29, 2025
Hubble: a Model Suite to Advance the Study of LLM Memorization Paper • 2510.19811 • Published Oct 22, 2025
In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations Paper • 2602.15456 • Published Feb 17
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper • 2304.01373 • Published Apr 3, 2023 • 9
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation Paper • 2505.24456 • Published May 30, 2025
A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge'ez Script Paper • 2507.15142 • Published Jul 20, 2025
Behind Maya: Building a Multilingual Vision Language Model Paper • 2505.08910 • Published May 13, 2025 • 2
Robust and Fine-Grained Detection of AI Generated Texts Paper • 2504.11952 • Published Apr 16, 2025 • 12