Source Segment Encoding for Neural Machine Translation.

Qiang Wang,Tong Xiao,Jingbo Zhu
DOI: https://doi.org/10.1007/978-3-319-99495-6_28
2018-01-01
Abstract:Sequential word encoding lacks explicit representations of structural dependencies (e.g. tree, segment) over the source words in neural machine translation. Instead of using source syntax, in this paper we propose a source segment encoding (SSE) approach to modeling source segments in encoding process by two methods. One is to encode off-the-shelf n-grams of the source sentence into original source memory. The other is to jointly learn an optimal segmentation model with the translation model in an end-to-end manner without any supervision of segmentation. Experimental results show that the SSE method yields an improvement of 2.1+ BLEU points over the baselines on the Chinese-English translation task.
What problem does this paper attempt to address?