Active Learning for Spam Email Classification

Zheng Chen,Ruiwen Tao,Xiaoyang Wu,Zhimin Wei,Xiao Luo
DOI: https://doi.org/10.1145/3377713.3377789
2019-01-01
Abstract:Deep learning has yielded state-of-the-art performance on text classification tasks. In this paper, a new neural network based on Long-Short-Term-Memory model is applied to classify spam emails. Using deep learning method to classify spam emails requires large amounts of labeled data. To solve this problem, active learning method is used to reduce labeling cost and increase model adaptability. In this paper, it is found that the new model performs better than standard CNNs and RNNs on email classification task, and active learning methods can match state-of-the-art performance with just 10% of the labeled data.
What problem does this paper attempt to address?