RESEARCH ON CLASSICAL AND MODERN CHINESE SENTENCE ALIGNMENT

Ying Liu,Nan Wang
DOI: https://doi.org/10.3969/j.issn.1000-386x.2013.11.036
2013-01-01
Abstract:Sentences alignment for Chinese parallel corpus is studied in the paper .The parallel corpora are the original text ( classical Chinese) and its modern text translation ( modern text ) of Shiji ( Records of the Grand Historian ) written by SiMa Qian in the period of Western Han Dynasty .The log-linear model combines the length feature and sentence alignment mode feature of the sentence with the co -occurrence of Chinese words feature , in this way to align the sentences of the classical Chinese and the modern text of Shiji .Through the experiment it can be demonstrate that the precision and recall rate of sentence alignment reach the highest at 94.4%and 94.3%respectively when taking into account these three features at the same time .
What problem does this paper attempt to address?