hallucination · GitHub Topics

Libr-AI / OpenFactVerification

Loki: Open-source solution designed to automate the process of verifying factuality

人工智能 factuality hallucination

Python 1.06 k

6 个月前

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

#Awesome#Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

Awesome Lists calibration gpt-3 gpt-4 llms reliability robustness safety uncertainty-estimation uncertainty-quantification ChatGPT prompt-engineering prompting chain-of-thought in-context-learning large-language-models hallucination

740

1 个月前

VITA-MLLM / Woodpecker

#大语言模型#✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

hallucination hallucinations large-language-models 大语言模型 mllm multimodal-large-language-models multimodality

Python 635

4 个月前

amazon-science / RefChecker

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

factuality hallucination llms

Python 358

5 个月前

tianyi-lab / HallusionBench

#大语言模型#[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark gpt-4 gpt-4v llava benchmarks hallucination 大语言模型 lmm large-language-models large-vision-language-models

Python 280

5 个月前

FuxiaoLiu / LRV-Instruction

#大语言模型#[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

evaluation gpt-4 hallucination object-detection vision vqa llama vicuna llava gpt multimodal prompt-engineering ChatGPT evaluation-metrics foundation-models vision-and-language iclr iclr2024

Python 274

1 年前

IAAR-Shanghai / ICSFSurvey

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

large-language-model large-language-models self-improvement chain-of-thought hallucination reasoning data-augmentation decoding knowledge-distillation

Jupyter Notebook 164

4 个月前

IAAR-Shanghai / UHGEval

#大语言模型#[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark dataset evaluation 大语言模型 ChatGPT gpt-3 gpt-4 hallucinations large-language-models qwen hallucination huggingface huggingface-transformers openai openai-api ceval

Python 161

5 个月前

xieyuquanxx / awesome-Large-MultiModal-Hallucination

😎 curated list of awesome LMM hallucinations papers, methods & resources.

hallucination multi-modal lmm multimodal

150

1 年前

ictnlp / TruthX

#大语言模型#Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

hallucinations language-model 大语言模型 llm-inference baichuan chatglm ChatGPT gpt-4 hallucination llama llama2 llms mistral safety representation explainable-ai llama3

Python 146

1 年前

zjunlp / KnowledgeCircuits

#自然语言处理#[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

人工智能 interpretability large-language-models 自然语言处理 circuit hallucination transformer

Python 138

2 个月前

shufangxun / LLaVA-MoD

#大语言模型#[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

mixture-of-experts multimodal-large-language-models knowledge-distillation hallucination rlhf llava qwen 大语言模型 mllm moe

Python 127

14 天前

NishilBalar / Awesome-LVLM-Hallucination

#大语言模型#up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

hallucination large-vision-language-models multimodal-large-language-models large-language-models 大语言模型 mllm

114

7 天前

AmourWaltz / Reliable-LLM

hallucination knowledge reliable uncertainty

JavaScript 112

7 个月前

zjunlp / FactCHD

#自然语言处理#[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

large-language-models hallucination knowledge 自然语言处理 benchmark dataset

Python 87

1 年前

yfzhang114 / LLaVA-Align

This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

hallucination large-vision-language-models

Python 77

2 个月前

HillZhang1999 / ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

hallucination large-language-models

Python 63

1 年前

kereva-dev / kereva-scanner

#大语言模型#Code scanner to check for issues in prompts and LLM calls

code-scanning hallucination 大语言模型 llm-evaluation llm-security prompt-injection evaluation linter red-teaming 安全人工智能 ai-security 命令行界面

Python 56

7 天前

zjunlp / Deco

#自然语言处理#[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

人工智能 decoding hallucination large-language-models mllm multimodal-large-language-models 自然语言处理

Python 52

4 个月前

deshwalmahesh / PHUDGE

#自然语言处理#Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...

人工智能 evaluation finetuning 大语言模型机器学习自然语言处理 PyTorch sota hallucination llm-evaluation

Jupyter Notebook 49

9 个月前