Adversarial Domain Adaptation for Cross-lingual Information Retrieval with Multilingual BERT

Runchuan Wang,Zhao Zhang,Fuzhen Zhuang,Dehong Gao,Yi Wei,Qing He
DOI: https://doi.org/10.1145/3459637.3482050
2021-01-01
Abstract:Transformer-based language models (e.g. BERT, RoBERT, GPT, etc) have shown remarkable performance in many natural language processing tasks and their multilingual variants make it easier to handle cross-lingual tasks without using machine translation system. In this paper, we apply multilingual BERT in cross-lingual information retrieval (CLIR) task with triplet loss to learn the relevance between queries and documents written in different languages. Moreover, we align the token embeddings from different languages via adversarial networks to help the language model to learn cross-lingual sentence representation. We achieve the state-of-the-art result on the newly published CLIR dataset: CLIRMatrix. Furthermore, we show that the adversarial multilingual BERT can also get the competitive result in the zero-shot setting in some specific languages when we are lack of CLIR training data in a specific language.
What problem does this paper attempt to address?