Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

Trevor Cohn,Cong Duy Vu Hoang,Ekaterina Vymolova,Kaisheng Yao,Chris Dyer,Gholamreza Haffari
DOI: https://doi.org/10.48550/arXiv.1601.01085
2016-01-06
Abstract:Neural encoder-decoder models of machine translation have achieved impressive results, rivalling traditional translation models. However their modelling formulation is overly simplistic, and omits several key inductive biases built into traditional models. In this paper we extend the attentional neural translation model to include structural biases from word based alignment models, including positional bias, Markov conditioning, fertility and agreement over translation directions. We show improvements over a baseline attentional model and standard phrase-based model over several language pairs, evaluating on difficult languages in a low resource setting.
Computation and Language
What problem does this paper attempt to address?