A simple and efficient classifying algorithm

Jianhui Wang,Shuigeng Zhou,H. U. Yun-Fa
DOI: https://doi.org/10.1109/ICMLC.2003.1264441
2003-01-01
Abstract:Most of the present classifying methods are based on VSM (the vector space model), of which the widely used method is kNN (the k-nearest neighbors). But most of them are highly complicated on computation, and cannot be used on the occasion of classifying a large number of specimen and the classifier must be rebuilt to increment the training corpora in order to have tough scalability. Two new notions, mutual dependence and equivalent radius, are put forward in this paper. And then a new classifying algorithm based on the two notions, SECTILE is offered in this paper. Later SECTILE is applied to classifying Chinese documents and compared to kNN and CCC methods. The experimental results suggests that SECTILE outperforms kNN and CCC methods, and can be used online to classify a large number of specimen and has good scalability, while the precision and recall of classification are kept.
What problem does this paper attempt to address?