Detecting ham and spam emails using feature union and supervised machine learning models

Furqan Rustam,Najia Saher,Arif Mehmood,Ernesto Lee,Sandrilla Washington,Imran Ashraf
DOI: https://doi.org/10.1007/s11042-023-14814-2
IF: 2.577
2023-03-10
Multimedia Tools and Applications
Abstract:Spam emails are cyber nuisances that cause serious security threats including personal and financial information. Although several spam detection approaches exist, detecting new strains of spam messages is challenging that requires a reliable and efficient intelligent spam email detection approach. This study utilizes features from the text of emails to determine whether it is spam or normal. Multiple features are combined to obtain a higher accuracy for spam email detection. Experiments involve machine learning and deep learning models and the influence of data resampling is also investigated. Performance analysis is done using F1 score, recall, precision, and accuracy, as well as comparison with state-of-the-art approaches. Random forest and logistic regression achieve the highest accuracy scores 0.991 and 0.990, respectively which is much better than existing models.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?