StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Zhuoqun Li,Xuanang Chen,Haiyang Yu,Hongyu Lin,Yaojie Lu,Qiaoyu Tang,Fei Huang,Xianpei Han,Le Sun,Yongbin Li
2024-10-25
Abstract:Retrieval-augmented generation (RAG) is a key means to effectively enhance large language models (LLMs) in many knowledge-based tasks. However, existing RAG methods struggle with knowledge-intensive reasoning tasks, because useful information required to these tasks are badly scattered. This characteristic makes it difficult for existing RAG methods to accurately identify key information and perform global reasoning with such noisy augmentation. In this paper, motivated by the cognitive theories that humans convert raw information into various structured knowledge when tackling knowledge-intensive reasoning, we proposes a new framework, StructRAG, which can identify the optimal structure type for the task at hand, reconstruct original documents into this structured format, and infer answers based on the resulting structure. Extensive experiments across various knowledge-intensive tasks show that StructRAG achieves state-of-the-art performance, particularly excelling in challenging scenarios, demonstrating its potential as an effective solution for enhancing LLMs in complex real-world applications.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges faced by existing Retrieval - Augmented Generation (RAG) methods when dealing with knowledge - intensive reasoning tasks. Specifically, existing RAG methods are difficult to effectively handle these tasks because the required relevant information is scattered in a large number of documents, making it difficult for the model to accurately identify key information and perform global reasoning. ### Problem Background With the development of deep - learning technology, large - scale language models (LLMs) have performed excellently in natural - language - processing tasks and are widely used in complex real - world scenarios. However, LLMs still have limitations in factual tasks, especially in tasks that require domain - specific knowledge, real - time updated information, and proprietary knowledge. To solve this problem, researchers have developed the Retrieval - Augmented Generation (RAG) method to enhance the performance of LLMs by retrieving external knowledge. ### Existing Problems Although the RAG method performs well in many knowledge - based question - answering tasks, it faces challenges in knowledge - intensive reasoning tasks. Such tasks usually require a large amount of useful information, and this information is scattered in multiple documents. For example, when analyzing financial reports, LLMs need to extract relevant financial indicators from multiple documents and conduct comprehensive comparison and integrated analysis. However, the standard RAG method is difficult to accurately retrieve all relevant text fragments and integrate multiple key information for reasoning, resulting in poor performance. ### Solution To solve the above problems, the paper proposes the StructRAG framework, which constructs and utilizes structured knowledge through a hybrid information structuring mechanism to meet the task requirements in the most appropriate format. Specifically, StructRAG contains three modules: 1. **Hybrid Structure Router**: Select the optimal structure type according to the task requirements. 2. **Scattered Knowledge Structurizer**: Convert the original document into structured knowledge of the selected structure type. 3. **Structured Knowledge Utilizer**: Decompose complex problems, accurately extract knowledge, and infer the final answer. In this way, StructRAG can more effectively handle knowledge - intensive reasoning tasks, especially in cases where information is scattered, significantly improving the performance of the model. ### Experimental Results The experimental results show that StructRAG has achieved state - of - the - art performance in various knowledge - intensive reasoning tasks, especially when the task complexity increases, the performance improvement is more obvious. Compared with other baseline methods, StructRAG not only performs well in the Loong benchmark test but also achieves the best average performance in the Podcast Transcripts task. In conclusion, this paper aims to solve the deficiencies of existing RAG methods in knowledge - intensive reasoning tasks through the StructRAG framework and provides a more effective solution.