Active Learning for Online Spam Filtering

Wuying Liu,Ting Wang
DOI: https://doi.org/10.1007/978-3-540-68636-1_63
2008-01-01
Abstract:Spam filtering is defined as a task trying to label emails with spam or ham in an online situation. The online feature requires the spam filter has a strong timely generalization and has a high processing speed. Machine learning can be employed to fulfill the two requirements. In this paper, we propose a SVMEL (SVM Ensemble Learning) method to combine five simple filters for higher accuracy and an active learning method to choose training emails for less training time. The experiments results show the filter applying active learning method can reduce requirements of labeled training emails and reach steady-state performance more quickly.
What problem does this paper attempt to address?