Don't Forget to Connect! Improving RAG with Graph-based Reranking

Jialin Dong,Bahare Fatemi,Bryan Perozzi,Lin F. Yang,Anton Tsitsulin
2024-05-29
Abstract:Retrieval Augmented Generation (RAG) has greatly improved the performance of Large Language Model (LLM) responses by grounding generation with context from existing documents. These systems work well when documents are clearly relevant to a question context. But what about when a document has partial information, or less obvious connections to the context? And how should we reason about connections between documents? In this work, we seek to answer these two core questions about RAG generation. We introduce G-RAG, a reranker based on graph neural networks (GNNs) between the retriever and reader in RAG. Our method combines both connections between documents and semantic information (via Abstract Meaning Representation graphs) to provide a context-informed ranker for RAG. G-RAG outperforms state-of-the-art approaches while having smaller computational footprint. Additionally, we assess the performance of PaLM 2 as a reranker and find it to significantly underperform G-RAG. This result emphasizes the importance of reranking for RAG even when using Large Language Models.
Computation and Language,Artificial Intelligence,Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily focuses on improving the performance of Retrieval-Augmented Generation (RAG) in Open-Domain Question Answering (ODQA). Specifically, the paper addresses the following points: 1. **Inter-Document Connections**: - Existing RAG systems excel in retrieving relevant documents but fall short in leveraging the connections between documents. - This leads to the system easily overlooking documents that contain answers but are not obviously related to the question's context. 2. **Reranking**: - The reranking process is crucial for effectively filtering the retrieved documents. - A robust reranking mechanism helps better identify relevant documents and elevate their ranking positions, thereby improving precision matching performance metrics. 3. **Introduction of Graph Neural Networks (GNN)**: - The G-RAG method is proposed, which utilizes Graph Neural Networks to combine inter-document connection information and semantic information (through Abstract Meaning Representation Graphs, AMR) to provide a context-based reranking mechanism. - G-RAG outperforms existing methods while having lower computational overhead. 4. **Evaluating the Performance of Large Language Models (LLM) as Rerankers**: - The performance of large language models like PaLM 2 as rerankers was evaluated, and it was found that their performance is significantly inferior to G-RAG. - This emphasizes that even with large language models, the design of the reranking mechanism remains crucial. ### Summary This paper aims to improve the performance of RAG systems in open-domain question answering by introducing Graph Neural Networks and optimizing the reranking mechanism, particularly in handling inter-document connections and enhancing the accuracy of relevant document identification.