Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Soyeong Jeong,Jinheon Baek,Sukmin Cho,Sung Ju Hwang,Jong C. Park
2024-03-28
Abstract:Retrieval-Augmented Large Language Models (LLMs), which incorporate the non-parametric knowledge from external knowledge bases into LLMs, have emerged as a promising approach to enhancing response accuracy in several tasks, such as Question-Answering (QA). However, even though there are various approaches dealing with queries of different complexities, they either handle simple queries with unnecessary computational overhead or fail to adequately address complex multi-step queries; yet, not all user requests fall into only one of the simple or complex categories. In this work, we propose a novel adaptive QA framework, that can dynamically select the most suitable strategy for (retrieval-augmented) LLMs from the simplest to the most sophisticated ones based on the query complexity. Also, this selection process is operationalized with a classifier, which is a smaller LM trained to predict the complexity level of incoming queries with automatically collected labels, obtained from actual predicted outcomes of models and inherent inductive biases in datasets. This approach offers a balanced strategy, seamlessly adapting between the iterative and single-step retrieval-augmented LLMs, as well as the no-retrieval methods, in response to a range of query complexities. We validate our model on a set of open-domain QA datasets, covering multiple query complexities, and show that ours enhances the overall efficiency and accuracy of QA systems, compared to relevant baselines including the adaptive retrieval approaches. Code is available at:
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the efficiency and accuracy issues of Retrieval-Augmented Large Language Models (RAG) when handling queries of varying complexity. Specifically: 1. **Analysis of Existing Problems**: Although retrieval-augmented large language models improve response accuracy by incorporating external knowledge bases, current methods are overly complex for simple queries (leading to unnecessary computational overhead) and insufficient for complex multi-step queries. 2. **Research Objective**: A new adaptive question-answering framework, Adaptive-RAG, is proposed. It dynamically selects the most suitable strategy based on query complexity, ranging from the simplest single-step retrieval to complex multi-step reasoning, or even direct answers from the LLM without retrieval. 3. **Main Contributions**: - Highlighting that existing retrieval-augmented generation methods handle queries of different complexities in an overly uniform manner; - Evaluating query complexity through a small classifier and adjusting the operational strategy of the retrieval-augmented LLM accordingly; - Experimental results show that Adaptive-RAG significantly improves the overall accuracy and efficiency of the question-answering system across various benchmark datasets. In summary, this paper aims to optimize the performance of retrieval-augmented large language models in handling queries of varying complexity by introducing an adaptive mechanism, thereby achieving a more efficient and accurate question-answering system.