Chinese News Text Classification Based on Machine Learning Algorithm

Fang Miao,Pu Zhang,Libiao Jin,Hongda Wu
DOI: https://doi.org/10.1109/ihmsc.2018.10117
2018-08-01
Abstract:Text classification is the key technology for mining and organizing text information, which is the process of determining the text types automatically according to the content. Based on machine learning algorithm, text classification system includes four processes, namely text pretreatment, text representation, classifier training and classification. In this paper, a Chinese news text classification system model is designed. And in the classifier training part, we separately chose and compared K-nearest Neighbor, Naive Bayes, and Support Vector Machine as our classification algorithm. Then, we tested and analyzed these classifiers with each other and finally got a conclusion. The experimental conclusion shows that the Chinese news text classification system can get satisfied results based on the machine learning algorithm.
What problem does this paper attempt to address?