Apply Stacked Auto-Encoder to Spam Detection.

Guyue Mi,Yang Gao,Ying Tan
DOI: https://doi.org/10.1007/978-3-319-20472-7_1
2015-01-01
Abstract:In this paper, we apply Stacked Auto-encoder, one of the main types of deep networks, hot topic of machine learning recently, to spam detection and comprehensively compare its performance with other prevalent machine learning techniques those are commonly used in spam filtering. Experiments were conducted on five benchmark corpora, namely PU1, PU2, PU3, PUA and Enron-Spam. Accuracy and (F_1) measure are selected as the main criteria in analyzing and discussing the results. Experimental results demonstrate that Stacked Auto-encoder performs better than Naive Bayes, Support Vector Machine, Decision Tree, Boosting, Random Forest and traditional Artificial Neural Network both in accuracy and (F_1) measure, which endows deep learning with application in spam filtering in the real world.
What problem does this paper attempt to address?