Perplexity vs Model Performance Evaluation

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

aflah authored a paper 2 days ago

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization

aflah authored a paper 2 days ago

Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications

aflah authored a paper 2 days ago

Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models

View all activity

aflah

authored 9 papers 2 days ago

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization

Paper • 2206.04007 • Published Jun 8, 2022

Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications

Paper • 2407.19262 • Published Jul 27, 2024

Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models

Paper • 2502.13313 • Published Feb 9

Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection

Paper • 2311.09834 • Published Nov 16, 2023

The Art of Embedding Fusion: Optimizing Hate Speech Detection

Paper • 2306.14939 • Published Oct 8, 2023

Beyond Negativity: Re-Analysis and Follow-Up Experiments on Hope Speech Detection

Paper • 2306.01742 • Published May 10, 2023

Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective

Paper • 2604.23267 • Published 19 days ago

Fractional Rotation, Full Potential? Investigating Performance and Convergence of Partial RoPE

Paper • 2603.11611 • Published Mar 12

TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability

Paper • 2507.19419 • Published Sep 30, 2025

aflah

authored 7 papers 4 days ago

Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction

Paper • 2404.12957 • Published Apr 19, 2024

QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs

Paper • 2412.11763 • Published Dec 16, 2024

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Paper • 2406.17746 • Published Jun 25, 2024

Rote Learning Considered Useful: Generalizing over Memorized Data in LLMs

Paper • 2507.21914 • Published Jul 29, 2025

Hubble: a Model Suite to Advance the Study of LLM Memorization

Paper • 2510.19811 • Published Oct 22, 2025

In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations

Paper • 2602.15456 • Published Feb 17

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9

Henok

authored 2 papers 7 months ago

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

Paper • 2505.24456 • Published May 30, 2025

A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge'ez Script

Paper • 2507.15142 • Published Jul 20, 2025

DrishtiSharma

authored 2 papers about 1 year ago

Behind Maya: Building a Multilingual Vision Language Model

Paper • 2505.08910 • Published May 13, 2025 • 2

Robust and Fine-Grained Detection of AI Generated Texts

Paper • 2504.11952 • Published Apr 16, 2025 • 12

AI & ML interests

Recent Activity

Team members 7

perplexity-v-model-perf-eval's activity