Advancing Phishing Email Detection: A Comparative Study of Deep Learning Models

Najwa Altwaijry,Isra Al-Turaiki,Reem Alotaibi,Fatimah Alakeel
DOI: https://doi.org/10.3390/s24072077
IF: 3.9
2024-03-25
Sensors
Abstract:Black Phishing is one of the most dangerous attacks targeting individuals, organizations, and nations. Although many traditional methods for email phishing detection exist, there is a need to improve accuracy and reduce false-positive rates. Our work investigates one-dimensional CNN-based models (1D-CNNPD) to detect phishing emails in order to address these challenges. Additionally, further improvement is achieved with the augmentation of the base 1D-CNNPD model with recurrent layers, namely, LSTM, Bi-LSTM, GRU, and Bi-GRU, and experimented with the four resulting models. Two benchmark datasets were used to evaluate the performance of our models: Phishing Corpus and Spam Assassin. Our results indicate that, in general, the augmentations improve the performance of the 1D-CNNPD base model. Specifically, the 1D-CNNPD with Bi-GRU yields the best results. Overall, the performance of our models is comparable to the state of the art of CNN-based phishing email detection. The Advanced 1D-CNNPD with Leaky ReLU and Bi-GRU achieved 100% precision, 99.68% accuracy, an F1 score of 99.66%, and a recall of 99.32%. We observe that increasing model depth typically leads to an initial performance improvement, succeeded by a decline. In conclusion, this study highlights the effectiveness of augmented 1D-CNNPD models in detecting phishing emails with improved accuracy. The reported performance measure values indicate the potential of these models in advancing the implementation of cybersecurity solutions to combat email phishing attacks.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The main problem this paper attempts to address is improving the accuracy of phishing email detection and reducing the false positive rate. Although there are many traditional methods currently available for detecting phishing emails, these methods are not effective enough when faced with increasingly sophisticated phishing attacks. Therefore, the authors studied one-dimensional convolutional neural networks (1D-CNN) and their models combined with recurrent neural networks (such as LSTM, Bi-LSTM, GRU, Bi-GRU) to enhance the performance of phishing email detection. Specifically, the goals of the paper include: 1. **Evaluating 1D-CNN models of different depths**: Investigate the impact of model depth on performance to find the optimal and simplest model. 2. **Enhancing 1D-CNN models**: Improve the performance of 1D-CNN models by adding recurrent layers such as LSTM and GRU. 3. **Comparing the performance of different models**: Evaluate the performance of various models using two benchmark datasets (Phishing Corpus and Spam Assassin) and compare them with existing methods. Through these studies, the authors hope to validate the potential of deep learning models in phishing email detection and explore the relationship between model complexity and performance. Ultimately, the authors aim to develop a lightweight and efficient model that can effectively detect phishing emails, thereby improving the level of cybersecurity.