Cross-lingual Supervision Improves Unsupervised Neural Machine Translation

Mingxuan Wang,Hongxiao Bai,Hai Zhao,Lei Li
DOI: https://doi.org/10.18653/v1/2021.naacl-industry.12
2020-01-01
Abstract:We propose to improve unsupervised neural machine translation with cross-lingual supervision (CUNMT), which utilizes supervision signals from high resource language pairs to improve the translation of zero-source languages. Specifically, for training En-Ro system without parallel corpus, we can leverage the corpus from En-Fr and En-De to collectively train the translation from one language into many languages under one model. Simple and effective, CUNMT significantly improves the translation quality with a big margin in the benchmark unsupervised translation tasks, and even achieves comparable performance to supervised NMT. In particular, on WMT'14 En-Fr tasks CUNMT achieves 37.6 and 35.18 BLEU score, which is very close to the large scale supervised setting and on WMT'16 EnRo tasks CUNMT achieves 35.09 BLEU score which is even better than the supervised Transformer baseline.
What problem does this paper attempt to address?