Language-aware Interlingua for Multilingual Neural Machine Translation.

Changfeng Zhu,Heng Yu,Shanbo Cheng,Weihua Luo
DOI: https://doi.org/10.18653/v1/2020.acl-main.150
2020-01-01
Abstract:Multilingual neural machine translation (NMT) has led to impressive accuracy improvements in low-resource scenarios by sharing common linguistic information across languages. However, the traditional multilingual model fails to capture the diversity and specificity of different languages, resulting in inferior performance compared with individual models that are sufficiently trained. In this paper, we incorporate a language-aware interlingua into the Encoder-Decoder architecture. The interlingual network enables the model to learn a language-independent representation from the semantic spaces of different languages, while still allowing for language-specific specialization of a particular language-pair. Experiments show that our proposed method achieves remarkable improvements over state-of-the-art multilingual NMT baselines and produces comparable performance with strong individual models.
What problem does this paper attempt to address?