Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning

Rongxiang Weng,Qiang Wang,Wensen Cheng,Changfeng Zhu,Min Zhang

2023-09-19

Abstract:Neural machine translation (NMT) has achieved remarkable success in producing high-quality translations. However, current NMT systems suffer from a lack of reliability, as their outputs that are often affected by lexical or syntactic changes in inputs, resulting in large variations in quality. This limitation hinders the practicality and trustworthiness of NMT. A contributing factor to this problem is that NMT models trained with the one-to-one paradigm struggle to handle the source diversity phenomenon, where inputs with the same meaning can be expressed differently. In this work, we treat this problem as a bilevel optimization problem and present a consistency-aware meta-learning (CAML) framework derived from the model-agnostic meta-learning (MAML) algorithm to address it. Specifically, the NMT model with CAML (named CoNMT) first learns a consistent meta representation of semantically equivalent sentences in the outer loop. Subsequently, a mapping from the meta representation to the output sentence is learned in the inner loop, allowing the NMT model to translate semantically equivalent sentences to the same target sentence. We conduct experiments on the NIST Chinese to English task, three WMT translation tasks, and the TED M2O task. The results demonstrate that CoNMT effectively improves overall translation quality and reliably handles diverse inputs.

Computation and Language

What problem does this paper attempt to address?

The paper aims to address the issue of reliability in Neural Machine Translation (NMT) systems, particularly focusing on the challenge posed by the source diversity phenomenon. The source diversity problem refers to the fact that inputs with the same meaning can be expressed differently, leading to variations in the quality of translations produced by NMT models. Current NMT systems struggle with this issue because they are typically trained using a one-to-one paradigm, which does not handle the many-to-many nature of natural language well. To tackle this problem, the authors propose a Consistency-Aware Meta-Learning (CAML) framework. CAML is designed to improve the reliability of NMT by enabling the model to learn a consistent meta-representation from semantically equivalent sentences. The framework consists of two main components: 1. **Learning a Consistent Meta Representation**: In the outer loop, CAML trains the NMT model to learn a consistent meta-representation of semantically equivalent sentences. This is achieved through two training objectives: - Sentence-Level Objective: Forces semantically equivalent sentences to generate each other through partial reconstruction. - Word-Level Objective: Constrains the output distributions from semantically equivalent sentences to be similar at each decoding step. 2. **Mapping from Meta Representation to Output Sentence**: In the inner loop, the NMT model learns to map the meta-representation to the target sentence, allowing for more consistent and reliable translations.

Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning

Improving Neural Machine Translation with Sentence Alignment Learning.

Continual Learning for Neural Machine Translation

Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation

Improving Neural Machine Translation by Achieving Knowledge Transfer with Sentence Alignment Learning

Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation.

Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization

Neural Machine Translation with Adequacy-Oriented Learning

Meta-Learning for Low-Resource Neural Machine Translation.

Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation

Towards Enhancing Faithfulness for Neural Machine Translation

Neural Machine Translation Advised by Statistical Machine Translation

Unified Model Learning for Various Neural Machine Translation

Neural System Combination For Machine Translation

Bilingual–Visual Consistency for Multimodal Neural Machine Translation

Multi-channel Encoder for Neural Machine Translation

ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation

Improving Both Domain Robustness and Domain Adaptability in Machine Translation

Terminology-Enriched Meta-curriculum Learning for Domain Neural Machine Translation.

Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation