Abstract:As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is arguably the biggest hindrance to safely deploying these powerful LLMs into real-world production systems that impact people's lives. The journey toward widespread adoption of LLMs in practical settings heavily relies on addressing and mitigating hallucinations. Unlike traditional AI systems focused on limited tasks, LLMs have been exposed to vast amounts of online text data during training. While this allows them to display impressive language fluency, it also means they are capable of extrapolating information from the biases in training data, misinterpreting ambiguous prompts, or modifying the information to align superficially with the input. This becomes hugely alarming when we rely on language generation capabilities for sensitive applications, such as summarizing medical records, financial analysis reports, etc. This paper presents a comprehensive survey of over 32 techniques developed to mitigate hallucination in LLMs. Notable among these are Retrieval Augmented Generation (Lewis et al, 2021), Knowledge Retrieval (Varshney et al,2023), CoNLI (Lei et al, 2023), and CoVe (Dhuliawala et al, 2023). Furthermore, we introduce a detailed taxonomy categorizing these methods based on various parameters, such as dataset utilization, common tasks, feedback mechanisms, and retriever types. This classification helps distinguish the diverse approaches specifically designed to tackle hallucination issues in LLMs. Additionally, we analyze the challenges and limitations inherent in these techniques, providing a solid foundation for future research in addressing hallucinations and related phenomena within the realm of LLMs.

What problem does this paper attempt to address?

The paper primarily focuses on addressing the issue of "hallucination" in large language models (LLMs) when generating text. Hallucination refers to the phenomenon where the model generates content that appears to be real but is actually baseless and incorrect information. This problem severely impacts the reliability and safety of LLMs in practical applications, especially in sensitive areas such as medical record summarization, customer service dialogues, financial analysis reports, and legal advice, where even small errors can lead to serious consequences. The paper systematically investigates and categorizes hallucination mitigation techniques through the following three aspects: 1. **Systematic Classification**: Proposes a comprehensive classification system that categorizes hallucination mitigation techniques into different types, including various models such as visual language models (VLMs). 2. **Feature Synthesis**: Summarizes the key features of these mitigation techniques, providing structured guidance for future research. 3. **Discussion of Challenges and Solutions**: Discusses the limitations and challenges faced by existing techniques and proposes potential solutions and future research directions. Specifically, the paper provides a detailed introduction to various hallucination mitigation techniques, such as Retrieval-Augmented Generation (RAG), feedback-based self-improvement methods, and Prompt Engineering. It analyzes the working principles and pros and cons of each method. Through these techniques, researchers aim to improve the factual accuracy of the content generated by LLMs, thereby enhancing their reliability and trustworthiness in practical applications.

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Hallucination Detection and Hallucination Mitigation: An Investigation

Hallucination of Multimodal Large Language Models: A Survey

Towards Mitigating Hallucination in Large Language Models via Self-Reflection

Cognitive Mirage: A Review of Hallucinations in Large Language Models

Unravelling the Mysteries of Hallucination in Large Language Models: Strategies for Precision in Artificial Intelligence Language Generation

Beyond Fine-Tuning: Effective Strategies for Mitigating Hallucinations in Large Language Models for Data Analytics

A Survey on Hallucination in Large Vision-Language Models

A Survey of Hallucination in Large Visual Language Models

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models

A Comprehensive Survey of Hallucination in Large Language, Image, Video and Audio Foundation Models

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models

Mitigating Hallucinations Using Ensemble of Knowledge Graph and Vector Store in Large Language Models to Enhance Mental Health Support

Insights into Classifying and Mitigating LLMs' Hallucinations