Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Soyeong Jeong,Jinheon Baek,Sukmin Cho,Sung Ju Hwang,Jong C. Park

2024-03-28

Abstract:Retrieval-Augmented Large Language Models (LLMs), which incorporate the non-parametric knowledge from external knowledge bases into LLMs, have emerged as a promising approach to enhancing response accuracy in several tasks, such as Question-Answering (QA). However, even though there are various approaches dealing with queries of different complexities, they either handle simple queries with unnecessary computational overhead or fail to adequately address complex multi-step queries; yet, not all user requests fall into only one of the simple or complex categories. In this work, we propose a novel adaptive QA framework, that can dynamically select the most suitable strategy for (retrieval-augmented) LLMs from the simplest to the most sophisticated ones based on the query complexity. Also, this selection process is operationalized with a classifier, which is a smaller LM trained to predict the complexity level of incoming queries with automatically collected labels, obtained from actual predicted outcomes of models and inherent inductive biases in datasets. This approach offers a balanced strategy, seamlessly adapting between the iterative and single-step retrieval-augmented LLMs, as well as the no-retrieval methods, in response to a range of query complexities. We validate our model on a set of open-domain QA datasets, covering multiple query complexities, and show that ours enhances the overall efficiency and accuracy of QA systems, compared to relevant baselines including the adaptive retrieval approaches. Code is available at:

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the efficiency and accuracy issues of Retrieval-Augmented Large Language Models (RAG) when handling queries of varying complexity. Specifically: 1. **Analysis of Existing Problems**: Although retrieval-augmented large language models improve response accuracy by incorporating external knowledge bases, current methods are overly complex for simple queries (leading to unnecessary computational overhead) and insufficient for complex multi-step queries. 2. **Research Objective**: A new adaptive question-answering framework, Adaptive-RAG, is proposed. It dynamically selects the most suitable strategy based on query complexity, ranging from the simplest single-step retrieval to complex multi-step reasoning, or even direct answers from the LLM without retrieval. 3. **Main Contributions**: - Highlighting that existing retrieval-augmented generation methods handle queries of different complexities in an overly uniform manner; - Evaluating query complexity through a small classifier and adjusting the operational strategy of the retrieval-augmented LLM accordingly; - Experimental results show that Adaptive-RAG significantly improves the overall accuracy and efficiency of the question-answering system across various benchmark datasets. In summary, this paper aims to optimize the performance of retrieval-augmented large language models in handling queries of varying complexity by introducing an adaptive mechanism, thereby achieving a more efficient and accurate question-answering system.

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models

ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering

Retrieval-Augmented Generation for Large Language Models: A Survey

MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

Meta Knowledge for Retrieval Augmented Large Language Models

WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment

DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering

Reliable, Adaptable, and Attributable Language Models with Retrieval

RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering