An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach

Mohammad Amaz Uddin,Iqbal H. Sarker

2024-02-21

Abstract:Phishing email is a serious cyber threat that tries to deceive users by sending false emails with the intention of stealing confidential information or causing financial harm. Attackers, often posing as trustworthy entities, exploit technological advancements and sophistication to make detection and prevention of phishing more challenging. Despite extensive academic research, phishing detection remains an ongoing and formidable challenge in the cybersecurity landscape. Large Language Models (LLMs) and Masked Language Models (MLMs) possess immense potential to offer innovative solutions to address long-standing challenges. In this research paper, we present an optimized, fine-tuned transformer-based DistilBERT model designed for the detection of phishing emails. In the detection process, we work with a phishing email dataset and utilize the preprocessing techniques to clean and solve the imbalance class issues. Through our experiments, we found that our model effectively achieves high accuracy, demonstrating its capability to perform well. Finally, we demonstrate our fine-tuned model using Explainable-AI (XAI) techniques such as Local Interpretable Model-Agnostic Explanations (LIME) and Transformer Interpret to explain how our model makes predictions in the context of text classification for phishing emails.

Artificial Intelligence,Machine Learning,Cryptography and Security

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the challenges in **phishing email detection**. Specifically, the paper focuses on the following aspects: 1. **Severity of phishing emails**: Phishing emails are a serious cyber - threat. Attackers attempt to steal users' confidential information or cause financial losses by sending fake emails. With the progress and increasing complexity of technology, the detection and prevention of phishing emails have become more difficult. 2. **Limitations of existing methods**: Although much research has been devoted to phishing email detection, existing methods still have some shortcomings, especially in dealing with unbalanced data sets and explaining model predictions. 3. **Taking advantage of large - language models**: Large - language models (LLMs) and masked - language models (MLMs) have great potential and can provide innovative solutions to long - standing challenges. The paper proposes an optimized and fine - tuned DistilBERT model for phishing email detection. 4. **Model interpretability**: To improve the transparency and credibility of the model, the paper uses two explainable artificial intelligence (XAI) techniques - Local Interpretable Model - agnostic Explanations (LIME) and Transformer Interpret - to explain how the model makes predictions. ### Main contributions of the paper - **Proposes a Transformer - based phishing email detection model**, which is a fine - tuned version of DistilBERT. - **Utilizes two different XAI techniques, LIME and Transformer Interpret**, explores the interpretability of the fine - tuned model, and visualizes the percentage scores of these techniques in prediction. - **Demonstrates the performance of the model on unbalanced and balanced data sets**. Through these contributions, the paper not only improves the accuracy of phishing email detection but also enhances the transparency and interpretability of the model, which is crucial for trust and adoption in practical applications.

An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach

An Improved Transformer-based Model for Detecting Phishing, Spam, and Ham: A Large Language Model Approach

Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts

SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection

ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis

A Large-Scale Pretrained Deep Model for Phishing URL Detection

A Lightweight Multi-View Learning Approach for Phishing Attack Detection Using Transformer with Mixture of Experts

Devising and Detecting Phishing: Large Language Models vs. Smaller Human Models

Next-Generation Phishing: How LLM Agents Empower Cyber Attackers

Multimodal Large Language Models for Phishing Webpage Detection and Identification

ChatSpamDetector: Leveraging Large Language Models for Effective Phishing Email Detection

Devising and Detecting Phishing Emails Using Large Language Models

Phishing Email Detection Model Using Deep Learning

Phishing Website Detection through Multi-Model Analysis of HTML Content

Automatic Phishing Website Detection and Prevention Model Using Transformer Deep Belief Network

An Innovative Information Theory-based Approach to Tackle and Enhance The Transparency in Phishing Detection

Large Language Model Lateral Spear Phishing: A Comparative Study in Large-Scale Organizational Settings

Novel Interpretable and Robust Web-based AI Platform for Phishing Email Detection

ChatPhishDetector: Detecting Phishing Sites Using Large Language Models

A Transformer-based Model to Detect Phishing URLs