Chinese Text Categorization Technology Using BP Neural Network

YANG Xinwu,LI Sen,LIU Chunnian
DOI: https://doi.org/10.3969/j.issn.2095-347x.2008.03.007
2008-01-01
Abstract:This paper has illustrated the description of the Chinese text categorization problem,the key technology and system design,and base on that,this paper explains the method how to use BP artificial network(with momentum) to achieve the goal of automatically classifying Chinese texts into different categories.The method adopts the TF-IDF formula to calculate weight and uses Expected Cross Entropy method as a way of reducing space dimension.Finally,on the TanCorp12 text set,we use macro-average F1 and micro-average F1 as evaluation criterion to test the impact of parameters,such as input node number,training times,on the performance of the classifier.
What problem does this paper attempt to address?