Abstract:Retrieval-augmented generation (RAG) is a key means to effectively enhance large language models (LLMs) in many knowledge-based tasks. However, existing RAG methods struggle with knowledge-intensive reasoning tasks, because useful information required to these tasks are badly scattered. This characteristic makes it difficult for existing RAG methods to accurately identify key information and perform global reasoning with such noisy augmentation. In this paper, motivated by the cognitive theories that humans convert raw information into various structured knowledge when tackling knowledge-intensive reasoning, we proposes a new framework, StructRAG, which can identify the optimal structure type for the task at hand, reconstruct original documents into this structured format, and infer answers based on the resulting structure. Extensive experiments across various knowledge-intensive tasks show that StructRAG achieves state-of-the-art performance, particularly excelling in challenging scenarios, demonstrating its potential as an effective solution for enhancing LLMs in complex real-world applications.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenges faced by existing Retrieval - Augmented Generation (RAG) methods when dealing with knowledge - intensive reasoning tasks. Specifically, existing RAG methods are difficult to effectively handle these tasks because the required relevant information is scattered in a large number of documents, making it difficult for the model to accurately identify key information and perform global reasoning. ### Problem Background With the development of deep - learning technology, large - scale language models (LLMs) have performed excellently in natural - language - processing tasks and are widely used in complex real - world scenarios. However, LLMs still have limitations in factual tasks, especially in tasks that require domain - specific knowledge, real - time updated information, and proprietary knowledge. To solve this problem, researchers have developed the Retrieval - Augmented Generation (RAG) method to enhance the performance of LLMs by retrieving external knowledge. ### Existing Problems Although the RAG method performs well in many knowledge - based question - answering tasks, it faces challenges in knowledge - intensive reasoning tasks. Such tasks usually require a large amount of useful information, and this information is scattered in multiple documents. For example, when analyzing financial reports, LLMs need to extract relevant financial indicators from multiple documents and conduct comprehensive comparison and integrated analysis. However, the standard RAG method is difficult to accurately retrieve all relevant text fragments and integrate multiple key information for reasoning, resulting in poor performance. ### Solution To solve the above problems, the paper proposes the StructRAG framework, which constructs and utilizes structured knowledge through a hybrid information structuring mechanism to meet the task requirements in the most appropriate format. Specifically, StructRAG contains three modules: 1. **Hybrid Structure Router**: Select the optimal structure type according to the task requirements. 2. **Scattered Knowledge Structurizer**: Convert the original document into structured knowledge of the selected structure type. 3. **Structured Knowledge Utilizer**: Decompose complex problems, accurately extract knowledge, and infer the final answer. In this way, StructRAG can more effectively handle knowledge - intensive reasoning tasks, especially in cases where information is scattered, significantly improving the performance of the model. ### Experimental Results The experimental results show that StructRAG has achieved state - of - the - art performance in various knowledge - intensive reasoning tasks, especially when the task complexity increases, the performance improvement is more obvious. Compared with other baseline methods, StructRAG not only performs well in the Loong benchmark test but also achieves the best average performance in the Podcast Transcripts task. In conclusion, this paper aims to solve the deficiencies of existing RAG methods in knowledge - intensive reasoning tasks through the StructRAG framework and provides a more effective solution.

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning

How Much Can RAG Help the Reasoning of LLM?

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

BioRAG: A RAG-LLM Framework for Biological Question Reasoning

Struct-X: Enhancing Large Language Models Reasoning with Structured Data

Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation

Refiner: Restructure Retrieved Content Efficiently to Advance Question-Answering Capabilities

Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation

Leveraging Structured Information for Explainable Multi-hop Question Answering and Reasoning

WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs

RuAG: Learned-rule-augmented Generation for Large Language Models

KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation

Meta Knowledge for Retrieval Augmented Large Language Models

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering

RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation