Document classification approach by rough-set-based corner classification neural network

Weifeng Zhang,Baowen Xu,Zifeng Cui,Junling Xu
DOI: https://doi.org/10.3969/j.issn.1003-7985.2006.03.033
2006-01-01
Abstract:A rough set based corner classification neural network, the Rough-CC4, is presented to solve document classification problems such as document representation of different document sizes,document feature selection and document feature encoding.In the Rough-CC4,the documents are described by the equivalent classes of the approximate words.By this method,the dimensions representing the documents can be reduced,which can solve the precision problems caused by the different document sizes and also blur the differences caused by the approximate words.In the Rough-CC4,a binary encoding method is introduced,through which the importance of documents relative to each equivalent class is encoded.By this encoding method,the precision of the Rough-CC4 is improved greatly and the space complexity of the Rough-CC4 is reduced.The Rough-CC4 can be used in automatic classification of documents.
What problem does this paper attempt to address?