Abstract:The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), leading to remarkable advancements in text understanding and generation. Nevertheless, alongside these strides, LLMs exhibit a critical tendency to produce hallucinations, resulting in content that is inconsistent with real-world facts or user inputs. This phenomenon poses substantial challenges to their practical deployment and raises concerns over the reliability of LLMs in real-world scenarios, which attracts increasing attention to detect and mitigate these hallucinations. In this survey, we aim to provide a thorough and in-depth overview of recent advances in the field of LLM hallucinations. We begin with an innovative taxonomy of LLM hallucinations, then delve into the factors contributing to hallucinations. Subsequently, we present a comprehensive overview of hallucination detection methods and benchmarks. Additionally, representative approaches designed to mitigate hallucinations are introduced accordingly. Finally, we analyze the challenges that highlight the current limitations and formulate open questions, aiming to delineate pathways for future research on hallucinations in LLMs.

What problem does this paper attempt to address?

The paper aims to address the hallucination problem generated by large language models (LLMs) in natural language processing (NLP). Despite significant advancements in text understanding and generation, large language models tend to produce content that is inconsistent with real-world facts or user input, a phenomenon known as "hallucination." These hallucinations pose major challenges for practical deployment and raise concerns about the reliability of LLMs in real-world scenarios. The paper explores this issue through the following aspects: 1. **Redefining Hallucination Classification**: A more detailed hallucination classification framework is proposed, dividing hallucinations into two main categories: "factual hallucinations" and "faithfulness hallucinations." Factual hallucinations emphasize the discrepancy between generated content and verifiable real-world facts, while faithfulness hallucinations focus on the consistency between generated content and user instructions or context. 2. **Analyzing Causes of Hallucinations**: The paper delves into various causes of hallucinations, including data-level issues (such as erroneous data sources, knowledge boundary limitations), training phase issues (such as pre-training architecture flaws, insufficient target optimization), and inference phase issues (such as decoding strategy flaws). 3. **Detection Methods and Benchmarking**: It outlines effective methods for detecting LLM hallucinations and related benchmarks to assess the extent of hallucinations and the effectiveness of detection methods. 4. **Mitigation Strategies**: Various mitigation strategies designed to address different causes of hallucinations are detailed to improve the reliability and trustworthiness of LLMs. In summary, this review paper aims to provide guidance for future research by comprehensively analyzing the hallucination problem and its solutions, thereby promoting the development of more robust and trustworthy large language models.

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

A Survey on Hallucination in Large Vision-Language Models

A Survey of Hallucination in Large Visual Language Models

Hallucination of Multimodal Large Language Models: A Survey

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models

Cognitive Mirage: A Review of Hallucinations in Large Language Models

Hallucination Detection and Hallucination Mitigation: An Investigation

Unravelling the Mysteries of Hallucination in Large Language Models: Strategies for Precision in Artificial Intelligence Language Generation

Evaluation and Analysis of Hallucination in Large Vision-Language Models

Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

A Survey on Large Language Model Hallucination via a Creativity Perspective

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Survey of Hallucination in Natural Language Generation