Term Weighting Algorithm in Text Categorization Based on VSM

SU Li-hua,ZHU Zhang-hua,BAI Wen-hua
DOI: https://doi.org/10.3969/j.issn.1009-3044.2010.33.059
2010-01-01
Abstract:This paper discusses the application of Vector Space Model(VSM) in Text Categorization and analyses the traditional algorithm of term weighting: TF-IDF.TF-IDF only considers two factors: TF and IDF.In this paper,the introduction of inter-class factor in term weighting is put forward.Experimental results show that the improved algorithm to combine inter-class factor with TF-IDF outperforms the traditional methods in classification precision.
What problem does this paper attempt to address?