LightHouse: A Survey of AGI Hallucination

Feng Wang
2024-01-17
Abstract:With the development of artificial intelligence, large-scale models have become increasingly intelligent. However, numerous studies indicate that hallucinations within these large models are a bottleneck hindering the development of AI research. In the pursuit of achieving strong artificial intelligence, a significant volume of research effort is being invested in the AGI (Artificial General Intelligence) hallucination research. Previous explorations have been conducted in researching hallucinations within LLMs (Large Language Models). As for multimodal AGI, research on hallucinations is still in an early stage. To further the progress of research in the domain of hallucinatory phenomena, we present a bird's eye view of hallucinations in AGI, summarizing the current work on AGI hallucinations and proposing some directions for future research.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper primarily explores the issue of hallucination in the field of Artificial General Intelligence (AGI) and attempts to address the following core questions: 1. **Definition and Classification**: The paper first clarifies the concept of AGI hallucination and categorizes it into three types: - Conflicts within the model's intrinsic knowledge (e.g., inconsistencies between the language model's output and input prompts) - Conflicts in information forgetting and updating (e.g., inability to retain previous knowledge or integrate new information) - Conflicts in multimodal fusion (errors arising when integrating information from different modalities) 2. **Cause Analysis**: The paper analyzes the causes of AGI hallucination, including factors such as the distribution of training data, the timeliness of information, and the ambiguity between different modalities. 3. **Mitigation Strategies**: To address the issue of AGI hallucination, the paper reviews existing mitigation methods, covering various aspects such as data preparation, model training and fine-tuning, Reinforcement Learning from Human Feedback (RLHF), and post-processing during the inference stage. For example, reducing model hallucination by optimizing data quality, adopting appropriate training techniques, and utilizing external knowledge bases. 4. **Evaluation Methods**: Finally, the paper introduces various effective means of evaluating AGI hallucination, including rule-based methods, large-scale model-based methods, and human feedback-based methods, and discusses the development of related benchmark test sets. Through this research, the paper aims to advance the field of AGI and provide guidance for future research directions.