AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Yujia Zhou,Zheng Liu,Zhicheng Dou
2024-11-11
Abstract:The emergence of Large Language Models (LLMs) has significantly advanced natural language processing, but these models often generate factually incorrect information, known as "hallucination". Initial retrieval-augmented generation (RAG) methods like the "Retrieve-Read" framework was inadequate for complex reasoning tasks. Subsequent prompt-based RAG strategies and Supervised Fine-Tuning (SFT) methods improved performance but required frequent retraining and risked altering foundational LLM capabilities. To cope with these challenges, we propose Assistant-based Retrieval-Augmented Generation (AssistRAG), integrating an intelligent information assistant within LLMs. This assistant manages memory and knowledge through tool usage, action execution, memory building, and plan specification. Using a two-phase training approach, Curriculum Assistant Learning and Reinforced Preference Optimization. AssistRAG enhances information retrieval and decision-making. Experiments show AssistRAG significantly outperforms benchmarks, especially benefiting less advanced LLMs, by providing superior reasoning capabilities and accurate responses.
Computation and Language,Artificial Intelligence,Information Retrieval
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that large - language models (LLMs) may generate factual errors (referred to as the "hallucination" phenomenon) when generating information, and the limitations of existing retrieval - augmented generation (RAG) methods in handling complex reasoning tasks. Specifically: 1. **Factual errors (hallucination)**: Although LLMs have made remarkable progress in natural language processing, they sometimes generate inaccurate information. This phenomenon limits the application of LLMs in fields that require a high degree of accuracy. 2. **Limitations of existing RAG methods**: - Initial RAG methods (such as the "retrieve - read" framework) are effective for basic question - answering tasks, but perform poorly when handling complex multi - step reasoning tasks. - Subsequent prompt - based RAG strategies and supervised fine - tuning (SFT) methods, although they improve performance, require frequent retraining and may change the core capabilities of the underlying LLMs. To address these challenges, the authors propose the **Assistant - based Retrieval - Augmented Generation (ASSIST RAG)** framework, which improves the performance of LLMs in complex reasoning tasks by integrating an intelligent information assistant to manage memory and knowledge. This framework enhances information retrieval and decision - making capabilities through core capabilities such as tool use, action execution, memory construction, and plan specification. Experimental results show that ASSIST RAG significantly outperforms existing methods in multiple benchmark tests, especially when assisting weaker LLMs.