Improved Algorithm Based on Tfidf in Text Classification

Hao Jiang,Wenqiang Li
DOI: https://doi.org/10.4028/www.scientific.net/amr.403-408.1791
2011-01-01
Advanced Materials Research
Abstract:Traditional feature weighting algorithm TFIDF doesn't take some other factors which impact the feature weight into consideration, so this paper discusses the factors in details and proposes a new feature weighting algorithm called NTFIDF combined with these factors and TFIDF. Experiment on the KNN classifier shows that NTFIDF is better than TFIDF in text classification.
What problem does this paper attempt to address?