Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model.

Haiyang Wu,Daxiang Dong,Xiaoguang Hu,Dianhai Yu,Wei He,Hua Wu,Haifeng Wang,Ting Liu
DOI: https://doi.org/10.3115/v1/d14-1015
2014-01-01
Abstract:We investigate how to improve bilingual embedding which has been successfully used as a feature in phrase-based statistical machine translation (SMT). Despite bilingual embedding’s success, the contextual information, which is of critical importance to translation quality, was ignored in previous work. To employ the contextual information, we propose a simple and memory-efficient model for learning bilingual embedding, taking both the source phrase and context around the phrase into account. Bilingual translation scores generated from our proposed bilingual embedding model are used as features in our SMT system. Experimental results show that the proposed method achieves significant improvements on large-scale Chinese-English translation task.
What problem does this paper attempt to address?