Abstract:While NMT has achieved remarkable results in the last 5 years, production systems come with strict quality requirements in arbitrarily niche domains that are not always adequately covered by readily available parallel corpora. This is typically addressed by training domain specific models, using fine-tuning methods and some variation of back-translation on top of in-domain monolingual corpora. However, industrial practitioners can rarely afford to focus on a single domain. A far more typical scenario includes a set of closely related, yet succinctly different sub-domains. At <a class="link-external link-http" href="http://Booking.com" rel="external noopener nofollow">this http URL</a>, we need to translate property descriptions, user reviews, as well as messages, (for example those sent between a customer and an agent or property manager). An editor might need to translate articles across a set of different topics. An e-commerce platform would typically need to translate both the description of each item and the user generated content related to them. To this end, we propose MDT: a novel method to simultaneously fine-tune on several sub-domains by passing multidimensional sentence-level information to the model during training and inference. We show that MDT achieves results competitive to N specialist models each fine-tuned on a single constituent domain, while effectively serving all N sub-domains, therefore cutting development and maintenance costs by the same factor. Besides BLEU (industry standard automatic evaluation metric known to only weakly correlate with human judgement) we also report rigorous human evaluation results for all models and sub-domains as well as specific examples that better contextualise the performance of each model in terms of adequacy and fluency. To facilitate further research, we plan to make the code available upon acceptance.

Continuous Learning from Human Post-Edits for Neural Machine Translation

Continual Learning for Neural Machine Translation

Online Learning for Neural Machine Translation Post-editing

A User-Study on Online Adaptation of Neural Machine Translation to Human Post-Edits

Continuous Learning in Neural Machine Translation using Bilingual Dictionaries

Non-Parametric Online Learning from Human Feedback for Neural Machine Translation

Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation.

Learning to Refine Source Representations for Neural Machine Translation

Computer Assisted Translation with Neural Quality Estimation and Automatic Post-Editing

Can Neural Machine Translation be Improved with User Feedback?

Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation

Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation

Improving Neural Machine Translation Models with Monolingual Data

Learning Non-Monotonic Automatic Post-Editing of Translations from Human Orderings

Multi-Domain Adaptation in Neural Machine Translation Through Multidimensional Tagging

Leveraging GPT-4 for Automatic Translation Post-Editing

Learning to Remember Translation History with a Continuous Cache.

Towards Decoding as Continuous Optimization in Neural Machine Translation

Improving Multilingual Translation by Representation and Gradient Regularization

On Instruction-Finetuning Neural Machine Translation Models

Reinforcement Learning based Curriculum Optimization for Neural Machine Translation