Abstract:While NMT has achieved remarkable results in the last 5 years, production systems come with strict quality requirements in arbitrarily niche domains that are not always adequately covered by readily available parallel corpora. This is typically addressed by training domain specific models, using fine-tuning methods and some variation of back-translation on top of in-domain monolingual corpora. However, industrial practitioners can rarely afford to focus on a single domain. A far more typical scenario includes a set of closely related, yet succinctly different sub-domains. At <a class="link-external link-http" href="http://Booking.com" rel="external noopener nofollow">this http URL</a>, we need to translate property descriptions, user reviews, as well as messages, (for example those sent between a customer and an agent or property manager). An editor might need to translate articles across a set of different topics. An e-commerce platform would typically need to translate both the description of each item and the user generated content related to them. To this end, we propose MDT: a novel method to simultaneously fine-tune on several sub-domains by passing multidimensional sentence-level information to the model during training and inference. We show that MDT achieves results competitive to N specialist models each fine-tuned on a single constituent domain, while effectively serving all N sub-domains, therefore cutting development and maintenance costs by the same factor. Besides BLEU (industry standard automatic evaluation metric known to only weakly correlate with human judgement) we also report rigorous human evaluation results for all models and sub-domains as well as specific examples that better contextualise the performance of each model in terms of adequacy and fluency. To facilitate further research, we plan to make the code available upon acceptance.

Tag-less Back-Translation

Tagged Back-Translation

Enhanced back-translation for low resource neural machine translation using self-training

Investigating Backtranslation in Neural Machine Translation

Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation

A Hybrid Approach for Improved Low Resource Neural Machine Translation using Monolingual Data

Multi-Domain Adaptation in Neural Machine Translation Through Multidimensional Tagging

Exploiting Monolingual Data at Scale for Neural Machine Translation.

Handling Syntactic Divergence in Low-resource Machine Translation

Semi-Supervised Neural Machine Translation Via Marginal Distribution Estimation

Tied Transformers: Neural Machine Translation with Shared Encoder and Decoder

Exploiting Reverse Target-Side Contexts for Neural Machine Translation Via Asynchronous Bidirectional Decoding

Domain, Translationese and Noise in Synthetic Data for Neural Machine Translation

Hindi to English: Transformer-Based Neural Machine Translation

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

On Synthetic Data for Back Translation

Bidirectional Boost: On Improving Tibetan-Chinese Neural Machine Translation With Back-Translation and Self-Learning

Phrase-Based & Neural Unsupervised Machine Translation

Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation

High-Quality Data Augmentation for Low-Resource NMT: Combining a Translation Memory, a GAN Generator, and Filtering

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation