Chinese Complex Long Sentences Processing Method for Chinese-Japanese Machine Translation

Dapeng Yin,Peilin Jiang,Fuji Ren,Shingo Kuroiwa
DOI: https://doi.org/10.1109/nlpke.2007.4368029
2007-01-01
Abstract:The research on machine translation has been lasting for many years, and now this research field is increasingly a thoroughly refined. In practical machine translation system, the processing of a simple and short Chinese sentence has somewhat good results. However, but the complex long Chinese sentence translation still has difficulties. In this paper a new hierarchical approach processing for Chinese complex long sentence through analysis of Chinese punctuation, conjunctive words and syntax function is proposed. The method synthetically uses semantic characteristic of source Chinese sentence, which includes grammatical features, the length of the source sentence, punctuation and functional words. First phase is conjunctive words for simplified segmentation, and then the syntax analysis, in order to process complex long sentence by the multi-hierarchical approach. A long sentence is divided into several parts; every part gets the correct translation of the result separately, and then is combined by the comprehensive approach to gain the complex long sentence translation result. Experiments show that our approach can significantly reduce the time consumption and numbers of ambiguity, and also improve the accuracy and readability when parsing Chinese complex long sentence.
What problem does this paper attempt to address?