Improve The Statistical Machine Translation Performance By Refining The Word Alignments

Hongfei Jiang,Tiejun Zhao,Sheng Li,Muyun Yang,Chunyue Zhang
2010-01-01
Abstract:Most of the stat-of-the-art statistical machine translation systems base their translation models on many-to-many word alignments obtained by statistical word aligner. Usually, such word alignments involve many low-quality word alignment links which can violate the phrase extraction. Especially, the low-quality word alignment links in one-to-many correspondence can cause the missing of many useful sub-phrase pairs in the phrase extraction. In this paper, we investigate how to alleviate this situation by eliminating the low quality links. Experiments on various scale datasets show that stable improvements can be obtained by the presented methods.
What problem does this paper attempt to address?