Analysis of Rapid Chinese Text Chunking Based on SVM

KONG Ling-peng,ZHANG Chen,ZHANG Quan
DOI: https://doi.org/10.3969/j.issn.1004-373x.2012.21.029
2012-01-01
Abstract:In the statistics learning theory,support vector machine(SVM) based on the structural risk minimization obtains nice performance with the number of samples increasing.Concerning natural language processing(NLP) model,the samples have to cover various language phenomenon.However,with the increase of training samples,the training and classi-fying time cost increase rapidly.In this paper,a view of text segmentation in text chunking is presented.The system builds two-class models separately corresponding to word tags and discovers some rules from the samples to accelerate and determine the chunk type through the inner tag feature.The experiment shows its improvement in analysis speed and accuracy.
What problem does this paper attempt to address?