AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Yujia Zhou,Zheng Liu,Zhicheng Dou

2024-11-11

Abstract:The emergence of Large Language Models (LLMs) has significantly advanced natural language processing, but these models often generate factually incorrect information, known as "hallucination". Initial retrieval-augmented generation (RAG) methods like the "Retrieve-Read" framework was inadequate for complex reasoning tasks. Subsequent prompt-based RAG strategies and Supervised Fine-Tuning (SFT) methods improved performance but required frequent retraining and risked altering foundational LLM capabilities. To cope with these challenges, we propose Assistant-based Retrieval-Augmented Generation (AssistRAG), integrating an intelligent information assistant within LLMs. This assistant manages memory and knowledge through tool usage, action execution, memory building, and plan specification. Using a two-phase training approach, Curriculum Assistant Learning and Reinforced Preference Optimization. AssistRAG enhances information retrieval and decision-making. Experiments show AssistRAG significantly outperforms benchmarks, especially benefiting less advanced LLMs, by providing superior reasoning capabilities and accurate responses.

Computation and Language,Artificial Intelligence,Information Retrieval

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that large - language models (LLMs) may generate factual errors (referred to as the "hallucination" phenomenon) when generating information, and the limitations of existing retrieval - augmented generation (RAG) methods in handling complex reasoning tasks. Specifically: 1. **Factual errors (hallucination)**: Although LLMs have made remarkable progress in natural language processing, they sometimes generate inaccurate information. This phenomenon limits the application of LLMs in fields that require a high degree of accuracy. 2. **Limitations of existing RAG methods**: - Initial RAG methods (such as the "retrieve - read" framework) are effective for basic question - answering tasks, but perform poorly when handling complex multi - step reasoning tasks. - Subsequent prompt - based RAG strategies and supervised fine - tuning (SFT) methods, although they improve performance, require frequent retraining and may change the core capabilities of the underlying LLMs. To address these challenges, the authors propose the **Assistant - based Retrieval - Augmented Generation (ASSIST RAG)** framework, which improves the performance of LLMs in complex reasoning tasks by integrating an intelligent information assistant to manage memory and knowledge. This framework enhances information retrieval and decision - making capabilities through core capabilities such as tool use, action execution, memory construction, and plan specification. Experimental results show that ASSIST RAG significantly outperforms existing methods in multiple benchmark tests, especially when assisting weaker LLMs.

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Retrieval-Augmented Generation for Large Language Models: A Survey

ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models

Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation

WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs

Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation

IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

Corrective Retrieval Augmented Generation

SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains

RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

Metacognitive Retrieval-Augmented Large Language Models

BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine

LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG