Abstract:Multilingual Neural Machine Translation (MNMT) models are commonly trained on a joint set of bilingual corpora which is acutely English-centric (i.e. English either as the source or target language). While direct data between two languages that are non-English is explicitly available at times, its use is not common. In this paper, we first take a step back and look at the commonly used bilingual corpora (WMT), and resurface the existence and importance of implicit structure that existed in it: multi-way alignment across examples (the same sentence in more than two languages). We set out to study the use of multi-way aligned examples to enrich the original English-centric parallel corpora. We reintroduce this direct parallel data from multi-way aligned corpora between all source and target languages. By doing so, the English-centric graph expands into a complete graph, every language pair being connected. We call MNMT with such connectivity pattern complete Multilingual Neural Machine Translation (cMNMT) and demonstrate its utility and efficacy with a series of experiments and analysis. In combination with a novel training data sampling strategy that is conditioned on the target language only, cMNMT yields competitive translation quality for all language pairs. We further study the size effect of multi-way aligned data, its transfer learning capabilities and how it eases adding a new language in MNMT. Finally, we stress test cMNMT at scale and demonstrate that we can train a cMNMT model with up to 111*112=12,432 language pairs that provides competitive translation quality for all language pairs.

Unsupervised Neural Machine Translation with Cross-Lingual Language Representation Agreement

Language Model-Driven Unsupervised Neural Machine Translation

Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation

Unified Model Learning for Various Neural Machine Translation

Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation

Reference Language based Unsupervised Neural Machine Translation

An Empirical study of Unsupervised Neural Machine Translation: analyzing NMT output, model's behavior and sentences' contribution

Phrase-Based & Neural Unsupervised Machine Translation

Unsupervised Transfer Learning in Multilingual Neural Machine Translation with Cross-Lingual Word Embeddings

Crosslingual Embeddings are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study

Unpaired Multimodal Neural Machine Translation via Reinforcement Learning

Machine Translation With Weakly Paired Bilingual Documents

Joint Training for Neural Machine Translation Models with Monolingual Data

Complete Multilingual Neural Machine Translation

UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

Unsupervised Multimodal Machine Translation for Low-Resource Distant Language Pairs

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

Reciprocal Supervised Learning Improves Neural Machine Translation

Cross-model Back-translated Distillation for Unsupervised Machine Translation

Self-supervised and Supervised Joint Training for Resource-rich Machine Translation

Scrambled Translation Problem: A Problem of Denoising UNMT