Abstract:Phishing is a cyber-attack that exploits victims' technical ignorance or naivety and commonly involves a Uniform Resources Locator (URL). As a result, it is beneficial to examine URLs before accessing them to spot a phishing assault. Several algorithms based on machine learning have been presented to detect phishing attempts. However, these approaches often suffer from lower performance outcomes, such as lower accuracy, longer response times, and higher false positive rates. Furthermore, many existing methods rely heavily on predefined feature sets, which may limit their adaptability and robustness. In contrast, our proposed method leverages a more dynamic feature selection process, which includes the Conditional Wasserstein Generative Adversarial Network (CWGAN) for addressing data imbalance and the Binary Grey Goose Optimization Algorithm (BGGOA) for optimal feature selection. This dynamic approach enhances the model's ability to adapt to varying data characteristics, improving detection performance. The proposed solution is divided into two stages: pre-deployment and deployment. During the pre-deployment stage, the dataset is preprocessed, including data transformation, handling irrelevant and redundant data, and ensuring data balancing. Minority samples are increased using CWGAN to avoid class imbalance. Features are then selected using BGGOA, resulting in a feature-reduced dataset used for training and testing ensemble deep learning classifiers, specifically the Novel Pyramid Depth-wise Separable-MobileNetV3 (PyDS-MV3) and Deformable Convolutional Residual Neural Network (DCRNN), termed PDSMV3-DCRNN. During the deployment phase, the Boosted ConvNeXt approach extracts URL features fed into the trained classifier to predict "phishing" or "benign". According to experimental findings, the proposed solution outperforms all other tested approaches, displaying a faster training time of 0.11 seconds and achieving an optimal accuracy of 99.21%.

On Phishing URLs Detection Using Feature Extension

STFN: Spatio-Temporal Fusion Network to Detect Ethereum Phishing Scams

Web2Vec: Phishing Webpage Detection Method Based on Multidimensional Features Driven by Deep Learning

Phishing Detection Based on Multi-Feature Neural Network.

A hybrid DNN-LSTM model for detecting phishing URLs

Protect sensitive sites from phishing attacks using features extractable from inaccessible phishing URLs

Phishpedia: A Hybrid Deep Learning Based Approach to Visually Identify Phishing Webpages

Detecting Phishing sites Without Visiting them

Phishing Website Detection Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning

A Large-Scale Pretrained Deep Model for Phishing URL Detection

An effective detection approach for phishing websites using URL and HTML features

An efficient multistage phishing website detection model based on the CASE feature framework: Aiming at the real web environment

Phishing Webpage Detection via Multi-Modal Integration of HTML DOM Graphs and URL Features Based on Graph Convolutional and Transformer Networks

CCBLA: a Lightweight Phishing Detection Model Based on CNN, BiLSTM, and Attention Mechanism

A Sophisticated Framework for the Accurate Detection of Phishing Websites

A Transformer-based Model to Detect Phishing URLs

An ensemble learning approach for detecting phishing URLs in encrypted TLS traffic

Phishing URL Detection using Machine Learning

Research on phishing webpage detection technology based on CNN-BiLSTM algorithm

PDSMV3-DCRNN: A Novel Ensemble Deep Learning Framework for Enhancing Phishing Detection and URL Extraction

A Malicious URL Detection Method Based on CNN