Abstract:Iterative retrieval refers to the process in which the model continuously queries the retriever during generation to enhance the relevance of the retrieved knowledge, thereby improving the performance of Retrieval-Augmented Generation (RAG). Existing work typically employs few-shot prompting or manually constructed rules to implement iterative retrieval. This introduces additional inference overhead and overlooks the remarkable reasoning capabilities of Large Language Models (LLMs). In this paper, we introduce Auto-RAG, an autonomous iterative retrieval model centered on the LLM's powerful decision-making capabilities. Auto-RAG engages in multi-turn dialogues with the retriever, systematically planning retrievals and refining queries to acquire valuable knowledge. This process continues until sufficient external information is gathered, at which point the results are presented to the user. To this end, we develop a method for autonomously synthesizing reasoning-based decision-making instructions in iterative retrieval and fine-tuned the latest open-source LLMs. The experimental results indicate that Auto-RAG is capable of autonomous iterative interaction with the retriever, effectively leveraging the remarkable reasoning and decision-making abilities of LLMs, which lead to outstanding performance across six benchmarks. Further analysis reveals that Auto-RAG can autonomously adjust the number of iterations based on the difficulty of the questions and the utility of the retrieved knowledge, without requiring any human intervention. Moreover, Auto-RAG expresses the iterative retrieval process in natural language, enhancing interpretability while providing users with a more intuitive experience\footnote{Code is available at \url{<a class="link-external link-https" href="https://github.com/ictnlp/Auto-RAG" rel="external noopener nofollow">this https URL</a>}.

Towards Explainability in Retrieval-Augmented LLMs

RAGE Against the Machine: Retrieval-Augmented LLM Explanations

Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations

From Feature Importance to Natural Language Explanations Using LLMs with RAG

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation

Towards Uncovering How Large Language Model Works: An Explainability Perspective

From Understanding to Utilization: A Survey on Explainability for Large Language Models

XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely

Retrieval-Augmented Generation for Large Language Models: A Survey

Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

How Much Can RAG Help the Reasoning of LLM?

RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model

RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots

Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation

Knowledge Graphs as Context Sources for LLM-Based Explanations of Learning Recommendations

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

LMExplainer: Grounding Knowledge and Explaining Language Models