Building a Hierarchically Aligned Chinese-English Parallel Treebank.

Dun Deng,Nianwen Xue
2014-01-01
Abstract:We construct a hierarchically aligned Chinese-English parallel treebank by manually doing word alignments and phrase alignments simultaneously on parallel phrase-based parse trees. The main innovation of our approach is that we leave words without a translation counterpart (which are mostly language-particular function words) unaligned on the word level, and locate and align the appropriate phrases which encapsulate them. In doing so, we harmonize word-level and phraselevel alignments. We show that this type of annotation can be performedwith high inter-annotator consistency and have both linguistic and engineering potentials.
What problem does this paper attempt to address?