Email Spam Detection Using Hierarchical Attention Hybrid Deep Learning Method

Sultan Zavrak,Seyhmus Yilmaz
DOI: https://doi.org/10.1016/j.eswa.2023.120977
2022-07-29
Abstract:Email is one of the most widely used ways to communicate, with millions of people and businesses relying on it to communicate and share knowledge and information on a daily basis. Nevertheless, the rise in email users has occurred a dramatic increase in spam emails in recent years. Processing and managing emails properly for individuals and companies are getting increasingly difficult. This article proposes a novel technique for email spam detection that is based on a combination of convolutional neural networks, gated recurrent units, and attention mechanisms. During system training, the network is selectively focused on necessary parts of the email text. The usage of convolution layers to extract more meaningful, abstract, and generalizable features by hierarchical representation is the major contribution of this study. Additionally, this contribution incorporates cross-dataset evaluation, which enables the generation of more independent performance results from the model's training dataset. According to cross-dataset evaluation results, the proposed technique advances the results of the present attention-based techniques by utilizing temporal convolutions, which give us more flexible receptive field sizes are utilized. The suggested technique's findings are compared to those of state-of-the-art models and show that our approach outperforms them.
Computation and Language,Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the spam detection problem in emails. With the increase in email users, the number of spam emails has also risen sharply, which has brought management difficulties to individuals and enterprises. Therefore, the paper proposes a new hybrid deep - learning method based on Convolutional Neural Network (CNN), Gated Recurrent Unit (GRU) and attention mechanism, aiming to identify and filter spam more effectively. Specifically, the main contributions of the paper are as follows: 1. **Selective Attention**: During the system training process, the network can selectively focus on the necessary parts of the email text. 2. **Hierarchical Representation**: Use convolutional layers to extract more meaningful, abstract and generalized features, and better capture the structural information of the email text through hierarchical representation. 3. **Cross - dataset Evaluation**: Introduce cross - dataset evaluation to generate more independent performance results and avoid over - fitting specific datasets. 4. **Temporal Convolution**: Utilize temporal convolutions to provide a more flexible receptive field size, thereby improving the flexibility and performance of the model. By comparing with the existing state - of - the - art methods, the paper shows that the proposed method is superior to the existing techniques in multiple metrics, especially when dealing with long - sequence data.