Improved KNN Text Categorization

ZHONG Jiang,LIU Ronghui
DOI: https://doi.org/10.3778/j.issn.1002-8331.2012.02.041
2012-01-01
Computer Engineering and Applications Journal
Abstract:In text categorization,the problems of large feature dimension and samples data distributed imbalanced influence the classified results.To this problem,this paper puts forward an improved KNN method.Using latent semantic analysis to reduce dimensionality of text feature matrix.Using improved KNN method based on density to realize text categorization.The experimental results show that the proposed method can effectively improve the text categorization precision.
What problem does this paper attempt to address?