Abstract:Based on a unified encoder-decoder framework with attentional mechanism, neural machine translation (NMT) models have attracted much attention and become the mainstream in the community of machine translation. Generally, the NMT decoders produce translation in a left-to-right way. As a result, only left-to-right target-side contexts from the generated translations are exploited, while the right-to-left target-side contexts are completely unexploited for translation. In this paper, we extend the conventional attentional encoder-decoder NMT framework by introducing a backward decoder, in order to explore asynchronous bidirectional decoding for NMT. In the first step after encoding, our backward decoder learns to generate the target-side hidden states in a right-to-left manner. Next, in each timestep of translation prediction, our forward decoder concurrently considers both the source-side and the reverse target-side hidden states via two attention models. Compared with previous models, the innovation in this architecture enables our model to fully exploit contexts from both source side and target side, which improve translation quality altogether. We conducted experiments on NIST Chinese-English, WMT English-German and Finnish-English translation tasks to investigate the effectiveness of our model. Experimental results show that (1) our improved RNN-based NMT model achieves significant improvements over the conventional RNNSearch by 1.44/-3.02, 1.11/-1.01, and 1.23/-1.27 average BLEU and TER points, respectively; and (2) our enhanced Transformer outperforms the standard Transformer by 1.56/-1.49, 1.76/-2.49, and 1.29/-1.33 average BLEU and TER points, respectively. We released our code at https://github.com/DeepLearnXMU/ABD-NMT.

Promoting Target Data in Context-aware Neural Machine Translation

Diving Deep into Context-Aware Neural Machine Translation

Selective Attention for Context-aware Neural Machine Translation

Context-Adaptive Document-Level Neural Machine Translation

Towards Making the Most of Context in Neural Machine Translation

Exploiting Cross-Sentence Context for Neural Machine Translation

A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

Context-Aware Learning for Neural Machine Translation

Measuring and Increasing Context Usage in Context-Aware Machine Translation

Improving Neural Machine Translation with Pre-trained Representation

Learning Contextualized Sentence Representations for Document-Level Neural Machine Translation

Exploiting Reverse Target-Side Contexts for Neural Machine Translation Via Asynchronous Bidirectional Decoding

Context Gates for Neural Machine Translation

When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion

Neural Machine Translation with Extended Context

Predicting Target Language CCG Supertags Improves Neural Machine Translation

Exploiting Monolingual Data at Scale for Neural Machine Translation.

Contrastive Learning for Context-aware Neural Machine TranslationUsing Coreference Information

Context-Aware Cross-Attention for Non-Autoregressive Translation

Using Whole Document Context in Neural Machine Translation

HanoiT: Enhancing Context-aware Translation via Selective Context