Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

Li Du,Yequan Wang,Xingrun Xing,Yiqun Ya,Xiang Li,Xin Jiang,Xuezhi Fang

2023-09-11

Abstract:Although demonstrating superb performance on various NLP tasks, large language models (LLMs) still suffer from the hallucination problem, which threatens the reliability of LLMs. To measure the level of hallucination of LLMs, previous works first categorize the hallucination according to the phenomenon similarity, then quantify the proportion that model outputs contain hallucinatory contents. However, such hallucination rates could easily be distorted by confounders. Moreover, such hallucination rates could not reflect the reasons for the hallucination, as similar hallucinatory phenomena may originate from different sources. To address these issues, we propose to combine the hallucination level quantification and hallucination reason investigation through an association analysis, which builds the relationship between the hallucination rate of LLMs with a set of risk factors. In this way, we are able to observe the hallucination level under each value of each risk factor, examining the contribution and statistical significance of each risk factor, meanwhile excluding the confounding effect of other factors. Additionally, by recognizing the risk factors according to a taxonomy of model capability, we reveal a set of potential deficiencies in commonsense memorization, relational reasoning, and instruction following, which may further provide guidance for the pretraining and supervised fine-tuning process of LLMs to mitigate the hallucination.

Artificial Intelligence,Computation and Language

What problem does this paper attempt to address?

The paper attempts to address the issue of hallucination in large language models (LLMs) when generating content. Although LLMs perform excellently in various natural language processing tasks, the content they generate sometimes contains untrue, illogical, or fictitious information, which threatens the reliability of LLMs, especially in high-trust domains such as healthcare or finance. Specifically, the paper focuses on the following points: 1. **Quantification of Hallucination Levels**: Existing research typically quantifies hallucination levels by categorizing hallucination phenomena and calculating the proportion of hallucinated content in the model output. However, this quantification method is susceptible to confounding factors and fails to reflect the causes of hallucination. 2. **Exploration of Hallucination Causes**: Existing research mainly focuses on specific tasks and specific types of hallucinations, without comprehensively exploring the root causes of hallucination. This is crucial for guiding model training to reduce hallucinations. To address these issues, the paper proposes a method that combines the quantification of hallucination levels and the exploration of hallucination causes, establishing the relationship between hallucination rates and a set of potential risk factors through association analysis. This method can observe the impact of each risk factor on hallucination rates, exclude the interference of other factors, and reveal potential deficiencies in the model in areas such as commonsense memory, relational reasoning, and instruction following. These findings can provide guidance for further pre-training and supervised fine-tuning to mitigate the hallucination problem.

Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

On Large Language Models' Hallucination with Regard to Known Facts

Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models

The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

Hallucination of Multimodal Large Language Models: A Survey

Sources of Hallucination by Large Language Models on Inference Tasks

Cost-Effective Hallucination Detection for LLMs

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Banishing LLM Hallucinations Requires Rethinking Generalization

Zero-Resource Hallucination Prevention for Large Language Models

Unravelling the Mysteries of Hallucination in Large Language Models: Strategies for Precision in Artificial Intelligence Language Generation

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

Evaluation and Analysis of Hallucination in Large Vision-Language Models