Cross-Lingual Training of Dense Retrievers for Document Retrieval

Peng Shi,Rui Zhang,Richard He Bai,Jimmy J. Lin
DOI: https://doi.org/10.18653/v1/2021.mrl-1.24
2021-01-01
Abstract:Dense retrieval has shown great success for passage ranking in English. However, its effectiveness for non-English languages remains unexplored due to limitation in training resources. In this work, we explore different transfer techniques for document ranking from English annotations to non-English languages. Our experiments reveal that zero-shot model-based transfer using mBERT improves search quality. We find that weakly-supervised target language transfer is competitive compared to generation-based target language transfer, which requires translation models.
What problem does this paper attempt to address?