Fine-tune the Entire RAG Architecture (including DPR retriever) for Question-Answering

Shamane Siriwardhana,Rivindu Weerasekera,Elliott Wen,Suranga Nanayakkara
DOI: https://doi.org/10.48550/arXiv.2106.11517
2021-06-22
Abstract:In this paper, we illustrate how to fine-tune the entire Retrieval Augment Generation (RAG) architecture in an end-to-end manner. We highlighted the main engineering challenges that needed to be addressed to achieve this objective. We also compare how end-to-end RAG architecture outperforms the original RAG architecture for the task of question answering. We have open-sourced our implementation in the HuggingFace Transformers library.
Information Retrieval,Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?