Graph-based KNN Text Classification

Zonghu Wang,Zhijing Liu
DOI: https://doi.org/10.1109/fskd.2010.5569866
2010-01-01
Abstract:Vector space model is used in most text categorization methods without considering the important information such as the order and co- occurrence of words within the text. In this paper we describe a novel approach of text classification using graph-based KNN. We reduce the number of features dimensions by a combined feature selection method. Then we present an improved graph-based text representation model and describe a novel graph-based KNN algorithm to predict the category of the texts in the testing set. The result shows that our approach can outperform traditional VSM-based KNN methods in terms of both accuracy and cost time.
What problem does this paper attempt to address?