E-mail Classification Based on Concept Vector Space Model

曾超,吕钊,顾君忠
DOI: https://doi.org/10.3724/sp.j.1087.2008.03248
2009-01-01
Journal of Computer Applications
Abstract:A new approach of e-mail classification based on the concept vector space model was proposed. In this approach, the eigenvector of the e-mail was extracted during training process by replacing terms with synonymy sets in WordNet and considering hypernymy-hyponymy relation between synonymy sets. Then, TF IWF IWF method was used to revise the weight of the concept vector. In the end, the type of e-mail was determined using the simple vector classification method. Compared with the term-based VSM approach, the results show that this approach can improve the accuracy of e-mail classification especially when the size of training set is small.
What problem does this paper attempt to address?