A Chinese Text Classification Method Using Implied Sub-class Information and Rough Set

JIN Kai-Min,MIAO Duo-Qian,DUAN Qi-Guo
DOI: https://doi.org/10.3969/j.issn.1002-137X.2008.02.041
2008-01-01
Computer Science
Abstract:Chinese Text Classification is a hot area of Information Retrieval and Web Mining.Existing methods have some defect in the phase of Feature Selection.They ignore the hidden sub-class information.This paper suggests a method to enhance the weight of key words of hidden sub-classes,so that we can discover valuable information of sub-classes,then we use Rough Set to construct classifier.The result of the experiment indicates that this method can effectively improve the recall of text classification,without increase the amount of words need reduction.
What problem does this paper attempt to address?