Improving Bilingual Lexicon Induction on Distant Language Pairs

Wenhao Zhu,Zhihao Zhou,Shujian Huang,Zhenya Lin,Xiangsheng Zhou,Yaofeng Tu,Jiajun Chen
DOI: https://doi.org/10.1007/978-981-15-1721-1_1
2019-01-01
Abstract:Aligning the representation spaces of two languages to induce a bilingual lexicon achieves attractive results on European language pairs. Unfortunately, current solutions perform terribly on distant language pairs. To address this problem, we analyze existing models for the lexicon induction task of distant language pairs, such as English-Chinese. We propose an framework for the task with improved preprocessing, mapping and inference accordingly. Experimental results show that our proposed approach enhances the accuracy of bilingual lexicons substantially on English-Chinese, as well as some other distant language pairs.
What problem does this paper attempt to address?