A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

S.M Towhidul Islam Tonmoy,S M Mehedi Zaman,Vinija Jain,Anku Rani,Vipula Rawte,Aman Chadha,Amitava Das
2024-01-09
Abstract:As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is arguably the biggest hindrance to safely deploying these powerful LLMs into real-world production systems that impact people's lives. The journey toward widespread adoption of LLMs in practical settings heavily relies on addressing and mitigating hallucinations. Unlike traditional AI systems focused on limited tasks, LLMs have been exposed to vast amounts of online text data during training. While this allows them to display impressive language fluency, it also means they are capable of extrapolating information from the biases in training data, misinterpreting ambiguous prompts, or modifying the information to align superficially with the input. This becomes hugely alarming when we rely on language generation capabilities for sensitive applications, such as summarizing medical records, financial analysis reports, etc. This paper presents a comprehensive survey of over 32 techniques developed to mitigate hallucination in LLMs. Notable among these are Retrieval Augmented Generation (Lewis et al, 2021), Knowledge Retrieval (Varshney et al,2023), CoNLI (Lei et al, 2023), and CoVe (Dhuliawala et al, 2023). Furthermore, we introduce a detailed taxonomy categorizing these methods based on various parameters, such as dataset utilization, common tasks, feedback mechanisms, and retriever types. This classification helps distinguish the diverse approaches specifically designed to tackle hallucination issues in LLMs. Additionally, we analyze the challenges and limitations inherent in these techniques, providing a solid foundation for future research in addressing hallucinations and related phenomena within the realm of LLMs.
Computation and Language
What problem does this paper attempt to address?
The paper primarily focuses on addressing the issue of "hallucination" in large language models (LLMs) when generating text. Hallucination refers to the phenomenon where the model generates content that appears to be real but is actually baseless and incorrect information. This problem severely impacts the reliability and safety of LLMs in practical applications, especially in sensitive areas such as medical record summarization, customer service dialogues, financial analysis reports, and legal advice, where even small errors can lead to serious consequences. The paper systematically investigates and categorizes hallucination mitigation techniques through the following three aspects: 1. **Systematic Classification**: Proposes a comprehensive classification system that categorizes hallucination mitigation techniques into different types, including various models such as visual language models (VLMs). 2. **Feature Synthesis**: Summarizes the key features of these mitigation techniques, providing structured guidance for future research. 3. **Discussion of Challenges and Solutions**: Discusses the limitations and challenges faced by existing techniques and proposes potential solutions and future research directions. Specifically, the paper provides a detailed introduction to various hallucination mitigation techniques, such as Retrieval-Augmented Generation (RAG), feedback-based self-improvement methods, and Prompt Engineering. It analyzes the working principles and pros and cons of each method. Through these techniques, researchers aim to improve the factual accuracy of the content generated by LLMs, thereby enhancing their reliability and trustworthiness in practical applications.