EfficientRAG: Efficient Retriever for Multi-Hop Question Answering

Ziyuan Zhuang,Zhiyang Zhang,Sitao Cheng,Fangkai Yang,Jia Liu,Shujian Huang,Qingwei Lin,Saravan Rajmohan,Dongmei Zhang,Qi Zhang
2024-09-26
Abstract:Retrieval-augmented generation (RAG) methods encounter difficulties when addressing complex questions like multi-hop queries. While iterative retrieval methods improve performance by gathering additional information, current approaches often rely on multiple calls of large language models (LLMs). In this paper, we introduce EfficientRAG, an efficient retriever for multi-hop question answering. EfficientRAG iteratively generates new queries without the need for LLM calls at each iteration and filters out irrelevant information. Experimental results demonstrate that EfficientRAG surpasses existing RAG methods on three open-domain multi-hop question-answering datasets.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the issues encountered in multi-hop question answering, particularly the challenges faced by traditional Retrieval-Augmented Generation (RAG) methods when dealing with complex questions. Specifically, the paper proposes EfficientRAG, an efficient multi-hop question answering retriever that can generate new queries during each iteration without invoking large language models (LLMs) and filter out irrelevant information. The goal of EfficientRAG is to improve the performance of multi-hop question answering tasks while reducing latency and cost. Experimental results on three open-domain multi-hop question answering datasets show that EfficientRAG outperforms existing RAG methods, significantly reducing the number of fragments that need to be retrieved while ensuring high recall rates, and demonstrating good cross-domain adaptability.