Abstract:Iterative retrieval refers to the process in which the model continuously queries the retriever during generation to enhance the relevance of the retrieved knowledge, thereby improving the performance of Retrieval-Augmented Generation (RAG). Existing work typically employs few-shot prompting or manually constructed rules to implement iterative retrieval. This introduces additional inference overhead and overlooks the remarkable reasoning capabilities of Large Language Models (LLMs). In this paper, we introduce Auto-RAG, an autonomous iterative retrieval model centered on the LLM's powerful decision-making capabilities. Auto-RAG engages in multi-turn dialogues with the retriever, systematically planning retrievals and refining queries to acquire valuable knowledge. This process continues until sufficient external information is gathered, at which point the results are presented to the user. To this end, we develop a method for autonomously synthesizing reasoning-based decision-making instructions in iterative retrieval and fine-tuned the latest open-source LLMs. The experimental results indicate that Auto-RAG is capable of autonomous iterative interaction with the retriever, effectively leveraging the remarkable reasoning and decision-making abilities of LLMs, which lead to outstanding performance across six benchmarks. Further analysis reveals that Auto-RAG can autonomously adjust the number of iterations based on the difficulty of the questions and the utility of the retrieved knowledge, without requiring any human intervention. Moreover, Auto-RAG expresses the iterative retrieval process in natural language, enhancing interpretability while providing users with a more intuitive experience\footnote{Code is available at \url{<a class="link-external link-https" href="https://github.com/ictnlp/Auto-RAG" rel="external noopener nofollow">this https URL</a>}.

Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models

Retrieval-Augmented Generation for Large Language Models: A Survey

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning

A Survey on Retrieval-Augmented Text Generation for Large Language Models

Retrieve Anything To Augment Large Language Models

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language Models

Generative Retrieval with Large Language Models

WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs

One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models

On the Role of Long-tail Knowledge in Retrieval Augmented Large Language Models

Layered Query Retrieval: an Adaptive Framework for Retrieval-Augmented Generation in Complex Question Answering for Large Language Models

Metacognitive Retrieval-Augmented Large Language Models

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Active Retrieval Augmented Generation

Generative Multi-Modal Knowledge Retrieval with Large Language Models

Benchmarking Large Language Models in Retrieval-Augmented Generation