Word alignment method and device of bitext

Li Peng,Liu Yang,Xue Ping,Sun Maosong
2013-01-01
Abstract:The invention discloses a word alignment method and a word alignment device of a bitext and belongs to the field of text string processing. The word alignment method comprises the following steps of respectively preprocessing an original text and a translation test of the bitext to be aligned; computing a connection gain added between any one of source language words and object language words; setting the alignment of an initial word to be empty alignment; searching word alignment which is restrained by an inversion transduction grammar by using greedy strategy iteration; and outputting the searched best word alignment meeting the inversion transduction grammar restriction to be served as a final alignment result. The device comprises a preprocessing module, a connection gain computation module, an initial word alignment generation module, a word alignment search module, and a word alignment result output module. The word alignment method and the device of the bitext search the word alignment which is restrained by the inversion transduction grammar by using greedy strategy iteration, and have the effects of increasing the speed of the word alignment and ensuring the quality of good word alignment.
What problem does this paper attempt to address?