Abstract:Retrieval Augmented Generation (RAG) has greatly improved the performance of Large Language Model (LLM) responses by grounding generation with context from existing documents. These systems work well when documents are clearly relevant to a question context. But what about when a document has partial information, or less obvious connections to the context? And how should we reason about connections between documents? In this work, we seek to answer these two core questions about RAG generation. We introduce G-RAG, a reranker based on graph neural networks (GNNs) between the retriever and reader in RAG. Our method combines both connections between documents and semantic information (via Abstract Meaning Representation graphs) to provide a context-informed ranker for RAG. G-RAG outperforms state-of-the-art approaches while having smaller computational footprint. Additionally, we assess the performance of PaLM 2 as a reranker and find it to significantly underperform G-RAG. This result emphasizes the importance of reranking for RAG even when using Large Language Models.

What problem does this paper attempt to address?

### Problems Addressed by the Paper The paper primarily focuses on improving the performance of Retrieval-Augmented Generation (RAG) in Open-Domain Question Answering (ODQA). Specifically, the paper addresses the following points: 1. **Inter-Document Connections**: - Existing RAG systems excel in retrieving relevant documents but fall short in leveraging the connections between documents. - This leads to the system easily overlooking documents that contain answers but are not obviously related to the question's context. 2. **Reranking**: - The reranking process is crucial for effectively filtering the retrieved documents. - A robust reranking mechanism helps better identify relevant documents and elevate their ranking positions, thereby improving precision matching performance metrics. 3. **Introduction of Graph Neural Networks (GNN)**: - The G-RAG method is proposed, which utilizes Graph Neural Networks to combine inter-document connection information and semantic information (through Abstract Meaning Representation Graphs, AMR) to provide a context-based reranking mechanism. - G-RAG outperforms existing methods while having lower computational overhead. 4. **Evaluating the Performance of Large Language Models (LLM) as Rerankers**: - The performance of large language models like PaLM 2 as rerankers was evaluated, and it was found that their performance is significantly inferior to G-RAG. - This emphasizes that even with large language models, the design of the reranking mechanism remains crucial. ### Summary This paper aims to improve the performance of RAG systems in open-domain question answering by introducing Graph Neural Networks and optimizing the reranking mechanism, particularly in handling inter-document connections and enhancing the accuracy of relevant document identification.

Don't Forget to Connect! Improving RAG with Graph-based Reranking

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

GRAG: Graph Retrieval-Augmented Generation

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations

Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs

Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers

GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation

Toward Optimal Search and Retrieval for RAG

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

LightRAG: Simple and Fast Retrieval-Augmented Generation

Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems

Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation

Retrieval-Augmented Generation for Large Language Models: A Survey

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation

ARAGOG: Advanced RAG Output Grading

Better RAG using Relevant Information Gain

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

FunnelRAG: A Coarse-to-Fine Progressive Retrieval Paradigm for RAG