Text categorization method based on improved KNN algorithm

Wang Aiping,Xu Xiaoyan,Guo Weiwei,Li Fanghua
DOI: https://doi.org/10.3969/j.issn.1674-7720.2011.18.004
2011-01-01
Abstract:This paper mainly introduces the central vector algorithms and KNN algorithms two classification method.According to KNN classification method in calculating text the shortcomings of the similarity,put out one improved scheme.The new scheme introduces the idea of central vector classification method.At last an empirical study of using the improved KNN algorithm,the central vector algorithm and the traditional KNN algorithm to categorize the Chinese text is conducted.The result of the experiment shows that,compared with central vector algorithm and traditional KNN algorithm,improved KNN algorithm has better categorization effect of the Chinese text,and verify the validity and feasibility of improvement KNN algorithm.
What problem does this paper attempt to address?