Alleviating Hallucinations in Large Language Models with Scepticism Modeling

Yetao Wu,Yihong Wang,Teng Chen,Chenxi Liu,Ningyuan Xi,Qingqing Gu,Hongyang Lei,Zhonglin Jiang,Yong Chen,Luo Ji
2024-09-10
Abstract:Hallucinations is a major challenge for large language models (LLMs), prevents adoption in diverse fields. Uncertainty estimation could be used for alleviating the damages of hallucinations. The skeptical emotion of human could be useful for enhancing the ability of self estimation. Inspirited by this observation, we proposed a new approach called Skepticism Modeling (SM). This approach is formalized by combining the information of token and logits for self estimation. We construct the doubt emotion aware data, perform continual pre-training, and then fine-tune the LLMs, improve their ability of self estimation. Experimental results demonstrate this new approach effectively enhances a model's ability to estimate their uncertainty, and validate its generalization ability of other tasks by out-of-domain experiments.
Computation and Language,Machine Learning
What problem does this paper attempt to address?