Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
-
Updated
May 11, 2026 - Python
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
Red Teaming python-framework for testing chatbots and GenAI systems.
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Attack to induce LLMs within hallucinations
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
An Easy-to-use Hallucination Detection Framework for LLMs.
[EMNLP 2023] Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
Code for Controlling Hallucinations at Word Level in Data-to-Text Generation (C. Rebuffel, M. Roberti, L. Soulier, G. Scoutheeten, R. Cancelliere, P. Gallinari)
Transactional Memory for AI Agents - Keep SQL and Vector DBs in sync with ACID-like guarantees
Code for PARENTing via Model-Agnostic Reinforcement Learning to Correct Pathological Behaviors in Data-to-Text Generation (Rebuffel, Soulier, Scoutheeten, Gallinari; INLG 2020)
A PyTorch implementation of the paper Thinking Hallucination for Video Captioning.
The full pipeline of creating UHGEval hallucination dataset
Add a description, image, and links to the hallucinations topic page so that developers can more easily learn about it.
To associate your repository with the hallucinations topic, visit your repo's landing page and select "manage topics."