Research and Design of a Spam Filtering System Based on Statistical Learning Theory

TANG Wei,CHENG Jia-xing,JI Xia
DOI: https://doi.org/10.3969/j.issn.1673-629X.2008.12.068
2008-01-01
Abstract:Classification is one of the most important research fields in data mining and machine learning.In recent years,there have been extensive studies and rapid progresses in automatic text categorization.Proposes a SVM text categorization on the basis of statistic theory,and designs a corresponding spam email filtering system.Compared with the naive Bayes,the validity of this system is proved.At last some future directions of the research are given.
What problem does this paper attempt to address?