Refiner: Restructure Retrieved Content Efficiently to Advance Question-Answering Capabilities

Zhonghao Li,Xuming Hu,Aiwei Liu,Kening Zheng,Sirui Huang,Hui Xiong
DOI: https://doi.org/10.18653/v1/2024.findings-emnlp.500
2024-01-01
Abstract:Large Language Models (LLMs) are limited by their parametric knowledge,leading to hallucinations in knowledge-extensive tasks. To address this,Retrieval-Augmented Generation (RAG) incorporates external document chunks toexpand LLM knowledge. Furthermore, compressing information from document chunksthrough extraction or summarization can improve LLM performance. Nonetheless,LLMs still struggle to notice and utilize scattered key information, a problemknown as the "lost-in-the-middle" syndrome. Therefore, we typically need torestructure the content for LLM to recognize the key information. We proposeRefiner, an end-to-end extract-and-restructure paradigm thatoperates in the post-retrieval process of RAG. Refiner leverages asingle decoder-only LLM to adaptively extract query-relevant contents verbatimalong with the necessary context, and section them based on theirinterconnectedness, thereby highlights information distinction, and alignsdownstream LLMs with the original context effectively. Experiments show that atrained Refiner (with 7B parameters) exhibits significant gain todownstream LLM in improving answer accuracy, and outperforms otherstate-of-the-art advanced RAG and concurrent compressing approaches in varioussingle-hop and multi-hop QA tasks. Notably, Refiner achieves a 80.5tokens reduction and a 1.6-7.0to the next best solution. Refiner is a plug-and-play solution thatcan be seamlessly integrated with RAG systems, facilitating its applicationacross diverse open-source frameworks.
What problem does this paper attempt to address?