Abstract:While NMT has achieved remarkable results in the last 5 years, production systems come with strict quality requirements in arbitrarily niche domains that are not always adequately covered by readily available parallel corpora. This is typically addressed by training domain specific models, using fine-tuning methods and some variation of back-translation on top of in-domain monolingual corpora. However, industrial practitioners can rarely afford to focus on a single domain. A far more typical scenario includes a set of closely related, yet succinctly different sub-domains. At <a class="link-external link-http" href="http://Booking.com" rel="external noopener nofollow">this http URL</a>, we need to translate property descriptions, user reviews, as well as messages, (for example those sent between a customer and an agent or property manager). An editor might need to translate articles across a set of different topics. An e-commerce platform would typically need to translate both the description of each item and the user generated content related to them. To this end, we propose MDT: a novel method to simultaneously fine-tune on several sub-domains by passing multidimensional sentence-level information to the model during training and inference. We show that MDT achieves results competitive to N specialist models each fine-tuned on a single constituent domain, while effectively serving all N sub-domains, therefore cutting development and maintenance costs by the same factor. Besides BLEU (industry standard automatic evaluation metric known to only weakly correlate with human judgement) we also report rigorous human evaluation results for all models and sub-domains as well as specific examples that better contextualise the performance of each model in terms of adequacy and fluency. To facilitate further research, we plan to make the code available upon acceptance.

Finding Sparse Structures for Domain Specific Neural Machine Translation

Learning Domain Specific Sub-layer Latent Variable for Multi-Domain Adaptation Neural Machine Translation

Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation

Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation

Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation

Structure-aware Domain Knowledge Injection for Large Language Models

Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks

Multi-Domain Adaptation in Neural Machine Translation Through Multidimensional Tagging

Compact Personalized Models for Neural Machine Translation

Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation

Building a Multi-domain Neural Machine Translation Model using Knowledge Distillation

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models

Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation

Exploiting Monolingual Data at Scale for Neural Machine Translation.

NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Dissecting Lottery Ticket Transformers: Structural and Behavioral Study of Sparse Neural Machine Translation

A Scenario-Generic Neural Machine Translation Data Augmentation Method

Bridging the Domain Gap: Improve Informal Language Translation Via Counterfactual Domain Adaptation

Vocabulary Adaptation for Distant Domain Adaptation in Neural Machine Translation