Managing Procedural Memory in LLM Agents: Control, Adaptation, and Evaluation Paper • 2606.23127 • Published 11 days ago • 19
The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar Paper • 2606.26015 • Published 9 days ago • 9
OCC-RAG: Optimal Cognitive Core for Faithful Question Answering Paper • 2606.00683 • Published May 30 • 98
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Paper • 2010.11929 • Published Oct 22, 2020 • 21
Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models Paper • 2512.00590 • Published Nov 29, 2025 • 52
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare Paper • 2602.06717 • Published Feb 6 • 75
Can Your Uncertainty Scores Detect Hallucinated Entity? Paper • 2502.11948 • Published Feb 17, 2025 • 3
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 120
Multimodal Evaluation of Russian-language Architectures Paper • 2511.15552 • Published Nov 19, 2025 • 79
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story Paper • 2511.15210 • Published Nov 19, 2025 • 91
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models Paper • 2401.00396 • Published Dec 31, 2023 • 6
Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications Paper • 2509.17671 • Published Sep 22, 2025 • 12
LettuceDetect: A Hallucination Detection Framework for RAG Applications Paper • 2502.17125 • Published Feb 24, 2025 • 14
TinyLettuce Collection This Collection contains our small, Ettin-encoder (https://arxiv.org/abs/2507.11412) based models trained on synthetic and RagTruth data. • 6 items • Updated May 21 • 4
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published Oct 6, 2025 • 117
view article Article LettuceDetect: A Hallucination Detection Framework for RAG Applications adaamko • Feb 28, 2025 • 13
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs Paper • 2509.08358 • Published Sep 10, 2025 • 13
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs Paper • 2508.11383 • Published Aug 15, 2025 • 40