Email Spam Detection Using Hierarchical Attention Hybrid Deep Learning Method

Sultan Zavrak,Seyhmus Yilmaz

DOI: https://doi.org/10.1016/j.eswa.2023.120977

2022-07-29

Abstract:Email is one of the most widely used ways to communicate, with millions of people and businesses relying on it to communicate and share knowledge and information on a daily basis. Nevertheless, the rise in email users has occurred a dramatic increase in spam emails in recent years. Processing and managing emails properly for individuals and companies are getting increasingly difficult. This article proposes a novel technique for email spam detection that is based on a combination of convolutional neural networks, gated recurrent units, and attention mechanisms. During system training, the network is selectively focused on necessary parts of the email text. The usage of convolution layers to extract more meaningful, abstract, and generalizable features by hierarchical representation is the major contribution of this study. Additionally, this contribution incorporates cross-dataset evaluation, which enables the generation of more independent performance results from the model's training dataset. According to cross-dataset evaluation results, the proposed technique advances the results of the present attention-based techniques by utilizing temporal convolutions, which give us more flexible receptive field sizes are utilized. The suggested technique's findings are compared to those of state-of-the-art models and show that our approach outperforms them.

Computation and Language,Machine Learning,Neural and Evolutionary Computing

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the spam detection problem in emails. With the increase in email users, the number of spam emails has also risen sharply, which has brought management difficulties to individuals and enterprises. Therefore, the paper proposes a new hybrid deep - learning method based on Convolutional Neural Network (CNN), Gated Recurrent Unit (GRU) and attention mechanism, aiming to identify and filter spam more effectively. Specifically, the main contributions of the paper are as follows: 1. **Selective Attention**: During the system training process, the network can selectively focus on the necessary parts of the email text. 2. **Hierarchical Representation**: Use convolutional layers to extract more meaningful, abstract and generalized features, and better capture the structural information of the email text through hierarchical representation. 3. **Cross - dataset Evaluation**: Introduce cross - dataset evaluation to generate more independent performance results and avoid over - fitting specific datasets. 4. **Temporal Convolution**: Utilize temporal convolutions to provide a more flexible receptive field size, thereby improving the flexibility and performance of the model. By comparing with the existing state - of - the - art methods, the paper shows that the proposed method is superior to the existing techniques in multiple metrics, especially when dealing with long - sequence data.

Email Spam Detection Using Hierarchical Attention Hybrid Deep Learning Method

Email Spam Detection using Deep Learning Approach

Email spam detection by deep learning models using novel feature selection technique and BERT

Detection of spam reviews through a hierarchical attention architecture with N-gram CNN and Bi-LSTM

A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail

Phishing email detection using deep learning algorithms

Attention Mechanism and Support Vector Machine for Image-Based E-Mail Spam Filtering

Analysis of e-Mail Spam Detection Using a Novel Machine Learning-Based Hybrid Bagging Technique

Phishing Email Detection Model Using Deep Learning

Application of Natural Language Processing and Machine Learning Boosted with Swarm Intelligence for Spam Email Filtering

Deep Learning Framework for Cyber Threat Situational Awareness Based on Email and URL Data Analysis

Improving malicious email detection through novel designated deep-learning architectures utilizing entire email

An Optimized Approach for Detection and Classification of Spam Email's Using Ensemble Methods

Content-based Spam Email Detection Using N-gram Machine Learning Approach

Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text

DeepCapture: Image Spam Detection Using Deep Learning and Data Augmentation

Semantic Graph Based Convolutional Neural Network for Spam e-mail Classification in Cybercrime Applications

Effective Email Spam Detection System using Extreme Gradient Boosting

A parallel hybrid approach integrating clonal selection with artificial bee colony for logistic regression in spam email detection

Boosting Social Spam Detection Via Attention Mechanisms on Twitter

A Two-Stage Spam Email Filtering Method Based on Naive Bayes and Hierarchical Clustering