Chinese sentence segmentation based on SVM method

MA Jin-shan,LIU Ting,Sheng Li
2009-01-01
Abstract:Aimed at the decreased performance of syntactic parsing caused by long sentence,this paper presents a method of identifying the segments based on the SVM classifier to solve this problem. In this method,a sentence is firstly divided into different segments,each of which is assigned a label to indicate its syntactic type. Then the sentence is parsed based on the segments. Finally,all the segments are linked together through the dependency relations and the parsing of the whole dependency tree is completed. Experiments show that the identification of segments decreases the complexity of parsing and improves the accuracy of Chinese dependency parsing.
What problem does this paper attempt to address?