Neural Machine Translation with Attention Based on a New Syntactic Branch Distance

Ru Peng,Zhitao Chen,Tianyong Hao,Yi Fang
DOI: https://doi.org/10.1007/978-981-15-1721-1_5
2019-01-01
Abstract:Attention mechanism has been proved to be able to improve the quality of neural machine translation by selectively focusing on partial words of a source sentence during translation process. Attention mechanism usually focuses on local attention by using solely the linear index distance of words while ignores syntax structures of sentences. In this paper, we extend local attention through syntax distance constraint, and propose an attention mechanism based on a new syntactic branch distance, which simultaneously pays attention to words with similar linear index distances and syntax-related words. Based on the English-to-German translation task, experiment results showed that our model outperforms a recent baseline method with an improvement of 1.61 BLEU points, demonstrating the effectiveness of the proposed model.
What problem does this paper attempt to address?