Abstract:Large Language Models (LLMs) have been found to have difficulty knowing they do not possess certain knowledge and tend to provide specious answers in such cases. Retrieval Augmentation (RA) has been extensively studied to mitigate LLMs' hallucinations. However, due to the extra overhead and unassured quality of retrieval, it may not be optimal to conduct RA all the time. A straightforward idea is to only conduct retrieval when LLMs are uncertain about a question. This motivates us to enhance the LLMs' ability to perceive their knowledge boundaries to help RA. In this paper, we first quantitatively measure LLMs' such ability and confirm their overconfidence. Then, we study how LLMs' certainty about a question correlates with their dependence on external retrieved information. We propose several methods to enhance LLMs' perception of knowledge boundaries and show that they are effective in reducing overconfidence. Additionally, equipped with these methods, LLMs can achieve comparable or even better performance of RA with much fewer retrieval calls.

What problem does this paper attempt to address?

This paper attempts to address the over - confidence problem of large - language models (LLMs) when facing unknown knowledge, and proposes a method to improve Retrieval Augmentation (RA) by enhancing the model's ability to perceive its own knowledge boundaries. Specifically, the main objectives of the paper include: 1. **Quantifying the LLMs' ability to perceive the boundaries of factual knowledge**: The study found that LLMs often show over - confidence when answering questions, even when they do not possess relevant knowledge. The author quantifies this perception ability of LLMs by defining several indicators (such as consistency, over - confidence, and conservatism), and finds that over - confidence is the main cause of the unsatisfactory perception ability. 2. **Studying the relationship between the certainty of LLMs and their dependence on external information**: The author explores whether LLMs will rely more on externally provided information when they are uncertain about a certain question. By classifying questions with different certainty levels, the author observes that the more uncertain LLMs are about a question, the more they tend to utilize the supporting retrieval documents. 3. **Proposing methods to reduce over - confidence**: In order to reduce the over - confidence of LLMs, the author starts from two directions: one is to prompt LLMs to be more cautious in their statements of certainty; the other is to improve their ability to provide correct answers. The author proposes several methods, including Punish, Challenge, Think Step by Step, Explain, and Generate, and verifies the effectiveness of these methods through experiments. 4. **Adaptive retrieval augmentation**: Based on the above methods, the author further explores how to achieve adaptive retrieval augmentation by enhancing the LLMs' ability to perceive knowledge boundaries. Specifically, the author shows through experiments that retrieval when the model expresses uncertainty can significantly reduce the number of unnecessary retrievals while maintaining or improving the question - answering performance. In summary, this paper aims to improve the performance of the model in open - domain question - answering tasks by reducing the over - confidence of LLMs, enhancing their ability to perceive their own knowledge boundaries, and thus more effectively using retrieval - augmentation techniques.

When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation

Retrieve Only When It Needs: Adaptive Retrieval Augmentation for Hallucination Mitigation in Large Language Models

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

Retrieve Anything To Augment Large Language Models

Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation

Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations

When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively

Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit

How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior

Can We Trust LLMs? Mitigate Overconfidence Bias in LLMs through Knowledge Transfer

ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence

Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

Improving Retrieval Augmented Language Model with Self-Reasoning