Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation

Guanhua Chen,Wenhan Yu,Lei Sha
2024-04-19
Abstract:While Retrieval-Augmented Generation (RAG) plays a crucial role in the application of Large Language Models (LLMs), existing retrieval methods in knowledge-dense domains like law and medicine still suffer from a lack of multi-perspective views, which are essential for improving interpretability and reliability. Previous research on multi-view retrieval often focused solely on different semantic forms of queries, neglecting the expression of specific domain knowledge perspectives. This paper introduces a novel multi-view RAG framework, MVRAG, tailored for knowledge-dense domains that utilizes intention-aware query rewriting from multiple domain viewpoints to enhance retrieval precision, thereby improving the effectiveness of the final inference. Experiments conducted on legal and medical case retrieval demonstrate significant improvements in recall and precision rates with our framework. Our multi-perspective retrieval approach unleashes the potential of multi-view information enhancing RAG tasks, accelerating the further application of LLMs in knowledge-intensive fields.
Computation and Language
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of multi-view information retrieval in Retrieval-Augmented Generation (RAG) within knowledge-intensive domains such as law and medicine. Currently, most RAG frameworks primarily rely on vector-based similarity measures for information retrieval, neglecting the intrinsic relationships and local similarities between the query and the retrieved information, resulting in suboptimal retrieval performance in specialized fields. Specifically: 1. **Lack of Multi-View Information**: - Current RAG methods in knowledge-intensive fields like law and medicine fail to adequately consider multi-view professional information, leading to imprecise, misleading, or lacking critical information in retrieval results. 2. **Limitations of Single Semantic Form**: - Previous research on multi-view retrieval has mainly focused on query rewriting in different semantic forms, neglecting the expression of domain-specific knowledge perspectives, thus failing to effectively improve retrieval accuracy. To address these issues, the paper proposes a new Multi-View RAG framework (MVRAG), which enhances retrieval accuracy from multiple domain perspectives through intent-aware query rewriting, thereby improving the effectiveness of final reasoning. Experimental results show that this framework significantly improves recall and precision in legal and medical case retrieval tasks. By introducing multi-view information, MVRAG can better capture the complex relationships between the query and the retrieved information, enhancing the application effectiveness of RAG systems in knowledge-intensive domains.