Document Classification With Cc4 Neural Network

Eh Chen,Sf Wang,Zy Zhang,Xf Wang
2001-01-01
Abstract:CC4 neural network is a new type of corner classification training algorithm for three-layered feedforward neural networks. On the condition that documents are almost of the same size, CC4 neural network is an efficient document classification algorithm. It would be impractical however to assume that all documents on WWW is of the same size in reality. To solve the problem incurred by the great difference in document sizes, we propose an MDS-NN based data indexing method thus making all documents be mapped to k-dimensional points while their distance information is kept well. We also extend CC4 neural network so that it can accept k-dimensional indexes of documents as its input, then transforms these indexes to binary sequences required by CC4 neural network Our experimental results show that the performance of our method is much better than that of original CC4.
What problem does this paper attempt to address?