When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation

Shiyu Ni,Keping Bi,Jiafeng Guo,Xueqi Cheng
2024-06-11
Abstract:Large Language Models (LLMs) have been found to have difficulty knowing they do not possess certain knowledge and tend to provide specious answers in such cases. Retrieval Augmentation (RA) has been extensively studied to mitigate LLMs' hallucinations. However, due to the extra overhead and unassured quality of retrieval, it may not be optimal to conduct RA all the time. A straightforward idea is to only conduct retrieval when LLMs are uncertain about a question. This motivates us to enhance the LLMs' ability to perceive their knowledge boundaries to help RA. In this paper, we first quantitatively measure LLMs' such ability and confirm their overconfidence. Then, we study how LLMs' certainty about a question correlates with their dependence on external retrieved information. We propose several methods to enhance LLMs' perception of knowledge boundaries and show that they are effective in reducing overconfidence. Additionally, equipped with these methods, LLMs can achieve comparable or even better performance of RA with much fewer retrieval calls.
Computation and Language
What problem does this paper attempt to address?
This paper attempts to address the over - confidence problem of large - language models (LLMs) when facing unknown knowledge, and proposes a method to improve Retrieval Augmentation (RA) by enhancing the model's ability to perceive its own knowledge boundaries. Specifically, the main objectives of the paper include: 1. **Quantifying the LLMs' ability to perceive the boundaries of factual knowledge**: The study found that LLMs often show over - confidence when answering questions, even when they do not possess relevant knowledge. The author quantifies this perception ability of LLMs by defining several indicators (such as consistency, over - confidence, and conservatism), and finds that over - confidence is the main cause of the unsatisfactory perception ability. 2. **Studying the relationship between the certainty of LLMs and their dependence on external information**: The author explores whether LLMs will rely more on externally provided information when they are uncertain about a certain question. By classifying questions with different certainty levels, the author observes that the more uncertain LLMs are about a question, the more they tend to utilize the supporting retrieval documents. 3. **Proposing methods to reduce over - confidence**: In order to reduce the over - confidence of LLMs, the author starts from two directions: one is to prompt LLMs to be more cautious in their statements of certainty; the other is to improve their ability to provide correct answers. The author proposes several methods, including Punish, Challenge, Think Step by Step, Explain, and Generate, and verifies the effectiveness of these methods through experiments. 4. **Adaptive retrieval augmentation**: Based on the above methods, the author further explores how to achieve adaptive retrieval augmentation by enhancing the LLMs' ability to perceive knowledge boundaries. Specifically, the author shows through experiments that retrieval when the model expresses uncertainty can significantly reduce the number of unnecessary retrievals while maintaining or improving the question - answering performance. In summary, this paper aims to improve the performance of the model in open - domain question - answering tasks by reducing the over - confidence of LLMs, enhancing their ability to perceive their own knowledge boundaries, and thus more effectively using retrieval - augmentation techniques.