Noun Phrase Alignment in Chinese-English Bilingual Corpora

Dong Liu
2003-01-01
Abstract:In this paper, a method is proposed to align bilingual noun phrases automatically in sentencealigned ChineseEnglish bilingual corpus. The characteristic of our method is to deal with highfrequency noun phrases and lowfrequency noun phrases separately without recognizing Chinese noun phrase accurately. Highfrequency noun phrases in English corpus are aligned to those in Chinese corpus using an iterative reevaluation algorithm according to the cooccurrence between English phrases and Chinese words in bilingual corpora; Lowfrequency noun phrases are aligned using the manual rules and Dice coefficient which is based on EnglishChinese dictionary. This method can take into account the alignment information on the whole, and acquire the result with high coverage rate.
What problem does this paper attempt to address?