Abstract:Retrieval-augmented generation (RAG) has revitalized Large Language Models (LLMs) by injecting non-parametric factual knowledge. Compared with long-context LLMs, RAG is considered an effective summarization tool in a more concise and lightweight manner, which can interact with LLMs multiple times using diverse queries to get comprehensive responses. However, the LLM-generated historical responses, which contain potentially insightful information, are largely neglected and discarded by existing approaches, leading to suboptimal results. In this paper, we propose \textit{graph of records} (\textbf{GoR}), which leverages historical responses generated by LLMs to enhance RAG for long-context global summarization. Inspired by the \textit{retrieve-then-generate} paradigm of RAG, we construct a graph by establishing an edge between the retrieved text chunks and the corresponding LLM-generated response. To further uncover the intricate correlations between them, GoR further features a \textit{graph neural network} and an elaborately designed \textit{BERTScore}-based objective for self-supervised model training, enabling seamless supervision signal backpropagation between reference summaries and node embeddings. We comprehensively compare GoR with 12 baselines across four long-context summarization datasets, and the results indicate that our proposed method reaches the best performance e.g., 15\%, 8\%, and 19\% improvement over retrievers w.r.t. Rouge-L, Rouge-1, and Rouge-2 on the WCEP dataset). Extensive experiments further demonstrate the effectiveness of GoR. Code is available at <a class="link-external link-https" href="https://github.com/ulab-uiuc/GoR" rel="external noopener nofollow">this https URL</a>

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is that existing Retrieval - Augmented Generation (RAG) methods fail to fully utilize the historical responses generated by large - language models (LLM) when handling the global summarization task of long - text. Specifically, although these historical responses contain valuable information, current methods often overlook them, resulting in sub - optimal results. To overcome this challenge, the paper proposes the "Graph of Records (GoR)" method, which enhances the performance of RAG in long - text global summarization by constructing a graph structure that connects retrieved text fragments with responses generated by LLM. ### Main contributions: 1. **Proposing "Graph of Records (GoR)"**: Utilize and organize the historical responses generated by LLM as a record graph to strengthen the performance of RAG in long - text global summarization. The research reveals that the fine - grained associations between LLM historical responses and text fragments in long documents can be effectively mined and utilized, thereby improving the performance of RAG. 2. **Utilizing Graph Neural Networks (GNN) and BERTScore - based objective functions**: Design a self - supervised training objective, optimize node embeddings through graph neural networks, enabling them to adaptively reflect the semantics and complex associations of input queries. In addition, the indirect supervision signals from self - supervised labels are crucial for the effective optimization of node embeddings. 3. **Experimental verification**: Evaluate the proposed method on four long - text summarization datasets. The experimental results show that GoR significantly outperforms multiple competing baselines. For example, on the WCEP dataset, GoR improves the Retrieval - Augmented Generation (RAG) by 15%, 8% and 19% on Rouge - L, Rouge - 1 and Rouge - 2 metrics respectively compared to the retriever. ### Specific problems solved: - **Utilization of historical responses**: Existing methods ignore the historical responses generated by LLM, and the information contained in these responses may be very useful for summary generation. GoR fully utilizes this information by constructing a graph structure to associate these historical responses with retrieved text fragments. - **Mining of complex associations**: There are complex associations between text fragments in long documents and responses generated by LLM, and these associations are difficult to capture by traditional semantic similarity methods. GoR effectively mines these associations through graph neural networks and self - supervised training objectives. - **Lack of supervision signals**: Unlike local summarization tasks, global summarization tasks lack clear labels and it is difficult to directly back - propagate effective supervision signals. GoR indirectly provides supervision signals through the BERTScore - based ranking mechanism and optimizes node embeddings. In conclusion, by proposing the "Graph of Records (GoR)" method, this paper solves the problem that existing RAG methods fail to fully utilize historical responses in long - text global summarization tasks, and significantly improves the quality and performance of summaries.

Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs

From Local to Global: A Graph RAG Approach to Query-Focused Summarization

Don't Forget to Connect! Improving RAG with Graph-based Reranking

GRAFT: Graph Retrieval Augmented Fine Tuning for Multi-Hop Query Summarization

Graph Retrieval-Augmented Generation: A Survey

GRAG: Graph Retrieval-Augmented Generation

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

LightRAG: Simple and Fast Retrieval-Augmented Generation

Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation

Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation

Retrieval Augmented Generation for Dynamic Graph Modeling

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs

Advanced RAG Models with Graph Structures: Optimizing Complex Knowledge Reasoning and Text Generation