Based on the simulated annealing algorithm for improved KNN categorization algorithm

Deng Zhen,Bao Hong
DOI: https://doi.org/10.3969/j.issn.1001-4160.2010.03.005
2010-01-01
Abstract:The KNN is a simple,valid and non-parameter method applied to text categorization.The traditional KNN has a fatal defect that time of similarity computing is huge.The practicality will be lost when the KNN is applied to text categorization with high dimension and huge samples.On the one hand,this paper analyzed the advantages and defects' of KNN algorithm,Uses the method of feature combination and feature word extracted to improve the KNN's feature word extracted.On the other hand,it also deeply research the thought of simulated annealing algorithm,uses the ways of typical optimize group and simulated annealing algorithm to advance the speed of KNN's text categorization.By combining the two improvements effectively,performed a based on the Simulated Annealing(S A) algorithm for Improved KNN Categorization Algorithm.The result of experiment demonstrates that the improved algorithm compared to the well-known algorithm KNN at the cost of time and the accuracy of classification have improved.
What problem does this paper attempt to address?