Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

Li Du,Yequan Wang,Xingrun Xing,Yiqun Ya,Xiang Li,Xin Jiang,Xuezhi Fang
2023-09-11
Abstract:Although demonstrating superb performance on various NLP tasks, large language models (LLMs) still suffer from the hallucination problem, which threatens the reliability of LLMs. To measure the level of hallucination of LLMs, previous works first categorize the hallucination according to the phenomenon similarity, then quantify the proportion that model outputs contain hallucinatory contents. However, such hallucination rates could easily be distorted by confounders. Moreover, such hallucination rates could not reflect the reasons for the hallucination, as similar hallucinatory phenomena may originate from different sources. To address these issues, we propose to combine the hallucination level quantification and hallucination reason investigation through an association analysis, which builds the relationship between the hallucination rate of LLMs with a set of risk factors. In this way, we are able to observe the hallucination level under each value of each risk factor, examining the contribution and statistical significance of each risk factor, meanwhile excluding the confounding effect of other factors. Additionally, by recognizing the risk factors according to a taxonomy of model capability, we reveal a set of potential deficiencies in commonsense memorization, relational reasoning, and instruction following, which may further provide guidance for the pretraining and supervised fine-tuning process of LLMs to mitigate the hallucination.
Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the issue of hallucination in large language models (LLMs) when generating content. Although LLMs perform excellently in various natural language processing tasks, the content they generate sometimes contains untrue, illogical, or fictitious information, which threatens the reliability of LLMs, especially in high-trust domains such as healthcare or finance. Specifically, the paper focuses on the following points: 1. **Quantification of Hallucination Levels**: Existing research typically quantifies hallucination levels by categorizing hallucination phenomena and calculating the proportion of hallucinated content in the model output. However, this quantification method is susceptible to confounding factors and fails to reflect the causes of hallucination. 2. **Exploration of Hallucination Causes**: Existing research mainly focuses on specific tasks and specific types of hallucinations, without comprehensively exploring the root causes of hallucination. This is crucial for guiding model training to reduce hallucinations. To address these issues, the paper proposes a method that combines the quantification of hallucination levels and the exploration of hallucination causes, establishing the relationship between hallucination rates and a set of potential risk factors through association analysis. This method can observe the impact of each risk factor on hallucination rates, exclude the interference of other factors, and reveal potential deficiencies in the model in areas such as commonsense memory, relational reasoning, and instruction following. These findings can provide guidance for further pre-training and supervised fine-tuning to mitigate the hallucination problem.