Research on Text Categorization Model Based on Latent Semantic Analysis and HS-SVM

Zhang Yufeng
DOI: https://doi.org/10.16353/j.cnki.1000-7490.2010.07.002
2010-01-01
Abstract:A text categorization model based on Latent Semantic Analysis and Hyper-sphere Support Vector Machine (HS-SVM) is proposed to improve the accuracy and efficiency of text categorization. As the convergence rate of using SVM to categorize the large-scale text is relatively slow,the Hyper-sphere Support Vector Machine is applied to text categorization and the Hyper-sphere Support Vector Machine Classification Learning Algorithm based on incremental learning is applied to training and categorization. Experiments show that the Hyper-sphere Support Vector Machine is an efficient solution to the SVM problem,and has the same accuracy as the SVM in the text categorization applications,but significantly reduces the complexity of the model and the training time.
What problem does this paper attempt to address?