#

hallucination-detection

Here are 24 public repositories matching this topic...

verifAI

nikolamilosevic86 / verifAI

VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)

nlp machine-learning natural-language-processing deep-learning verification pubmed openai medline nlp-machine-learning opensearch biomedical-data-science nli bioasq qdrant llm qlora mistral-7b hallucination-detection scifact

Updated Nov 14, 2024
Jupyter Notebook

aimonlabs / aimon-python-sdk

This repo hosts the Python SDK and related examples for AIMon, which is a proprietary, state-of-the-art system for detecting LLM quality issues such as Hallucinations. It can be used during offline evals, continuous monitoring or inline detection. We offer various model quality metrics that are fast, reliable and cost-effective.

continuous-monitoring guardrails instruction-following llm generative-ai hallucination-detection

Updated Nov 12, 2024
Python

IAAR-Shanghai / UHGEval

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark evaluation dataset openai hallucination huggingface huggingface-transformers ceval gpt-3 openai-api hallucinations gpt-4 large-language-models llm chatgpt qwen hallucination-evaluation hallucination-detection

Updated Nov 12, 2024
Python

serhanylmz / pas2

PAS2: A Python-based hallucination detection system that evaluates AI response consistency through paraphrasing and semantic similarity analysis. Features include response evaluation, similarity metrics, visualization tools, and a web interface for interactive testing.

visualization nlp benchmarking artificial-intelligence openai semantic-similarity gpt gradio nlp-machine-learning paraphrase-detection openai-api hallucination-detection

Updated Nov 11, 2024
Python

NishilBalar / Awesome-LVLM-Hallucination

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

mlm hallucination large-language-models llm mllm large-vision-language-models multimodal-large-language-models hallucination-evaluation hallucination-detection vision-language-models lvlm hallucination-mitigation hallucination-survey hallucination-research hallucination-benchmark multimodal-language-model

Updated Nov 5, 2024

mbzuai-nlp / fire

Fact-checking with Iterative Retrieval and Verification

factchecking factuality hallucination-detection

Updated Nov 3, 2024
Python

open-compass / ANAH

[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2

acl gpt neurips llms hallucination-detection

Updated Oct 29, 2024
Python

MuhammadTayyebHamid / Text-Classification-using-Logistic-Regression

Binary hallucination detection classifier using logistic regression

natural-language-processing logistic-regression binary-classification hallucination-detection

Updated Oct 20, 2024
Jupyter Notebook

darveenvijayan / autoevaluator

Fully automated LLM evaluator

openai evaluation-functions evaluation-framework evaluation-kit llm llmops hallucination-detection hallucination-mitigation

Updated Oct 14, 2024
Python

zjunlp / EasyDetect

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

natural-language-processing artificial-intelligence knowledge-graph generation multimodal hallucination aigc large-language-models generative-ai model-editing knowledge-editing multimodal-large-language-models knowlm easydetect hallucination-detection

Updated Sep 27, 2024
Python

BaluHarshavardan99 / Hallucination-in-Chat-bots

Hallucination in Chat-bots: Faithful Benchmark for Information-Seeking Dialogue

natural-language-processing chatbots bert-models roberta-large hallucination-detection

Updated Sep 11, 2024
Python

amarquaye / atlas-chrome

Chrome extension for the ATLAS project.

chrome-extension chrome hallucination llm hallucination-detection

Updated Sep 10, 2024
JavaScript

amarquaye / atlas

🔢Hallucination detector for Large Language Models.

ai hallucination large-language-models hallucination-detection

Updated Sep 10, 2024
Jupyter Notebook

amarquaye / atlas-api

API for the atlas project

python api fastapi llms hallucination-detection

Updated Sep 10, 2024
JavaScript

uptrain-ai / uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

machine-learning monitoring evaluation experimentation jailbreak-detection autoevaluation root-cause-analysis prompt-engineering llmops openai-evals llm-prompting llm-eval llm-test hallucination-detection

Updated Aug 18, 2024
Python

voidism / Lookback-Lens

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

text-generation factuality hallucinations large-language-models hallucination-detection

Updated Aug 13, 2024
Python

jhaayush2004 / RAG-Evaluation

Different approaches to evaluate RAG !!!

rag wandb giskard langchain vectara rag-evaluation hallucination-detection ragas bert-score

Updated Aug 13, 2024
Jupyter Notebook

deshwalmahesh / PHUDGE

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

nlp ai evaluation ml pytorch judge feedback-collection sota custom-dataset finetuning hallucination llm llm-evaluation hallucination-detection phi-3

Updated Jul 10, 2024
Jupyter Notebook

noanonkes / Hallucination-Detection-in-LLMs

Detecting Hallucinations in Large Language Model Generations using Graph Structures

natural-language-processing graphs graph-attention-network hallucination-detection

Updated Jul 4, 2024
Python

ivan-kud / semeval-2024-shroom

Competition: SemEval-2024 Task-6 - SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

nlp-machine-learning hallucination-detection

Updated Jun 28, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the hallucination-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hallucination-detection topic, visit your repo's landing page and select "manage topics."