Enhancing the RAG Retrieval Engine Through Multi-Encoder Fusion

Huaiyuan He,Wenzhuo Shen,Tiyan Shen,Jie Shen
DOI: https://doi.org/10.1109/ICECAI62591.2024.10674962
2024-05-31
Abstract:With the rapid advancement of large language model technology, Retriever-Augmented Generation (RAG), which is based on vector databases, has demonstrated extensive potential for application. This paper focuses on the text retrieval phase of RAG, enhancing text recall by introducing multiple text encoders. Consequently, this paper designs a neural network model and loss function for multi-encoder fusion and determines the optimal weight allocation for multi-encoder fusion through neural network training. Experimental results demonstrate that the predicted fusion weights from the neural network trained in this paper can be closely aligned with the optimal fusion weights. By using the predicted BCE and BGE encoding model weights, an improvement of 6% in Mean Reciprocal Rank (MRR) is achieved.
Computer Science
What problem does this paper attempt to address?