Abstract:Malware is one of the most popular cyber-attacks, and it is becoming more common on the network every day. In contrast to benign transmission, which typically exhibits symmetrical patterns, malware communication often shows asymmetrical behaviours, making detection a complex challenge. Fortunately, malware can be distinguished and identified for actual activities utilizing a variety of artificial intelligence methods. However, insufficient work has been allocated to the problem of handling high-dimensional and huge data. This paper proposes a novel deep learning-based approach to identify malicious Uniform Resource Locators (URLs) specifically designed to handle the challenges posed by large-scale and complex data. Initially, input data is sourced from a comprehensive Kaggle dataset, which includes diverse and large-scale URL samples. The URLs are then transformed into vector representations using a Vector Embedding Module, which employs a character-level word embedding technique to capture intricate patterns within the URLs. To further refine the data, the Chaotic Kookaburra Efficient-Bo Network (CKEBO-Net) is applied to extract the most significant features from these vectors, effectively reducing the dimensionality and computational burden. Subsequently, the Cascaded Capsule Twin Attentional Dilated Convolutional Network (C 2 TA_DiCN) model is introduced to classify and identify malicious URLs with high precision. This model leverages the unique strengths of capsule networks and attentional mechanisms, enhancing its capability to capture subtle dependencies within the data. Furthermore, the Lyrebird Meta-heuristic Optimization (LMO) algorithm is used to fine-tune the model parameters appropriately, ensuring that the training process is efficient and robust. The proposed approach is implemented using Python and rigorously evaluated on the Kaggle dataset. Simulation results demonstrate that the proposed method significantly outperforms existing models, achieving a malicious URL detection accuracy of 99.7%.

Bidirectional IndRNN malicious webpages detection algorithm based on convolutional neural network and attention mechanism.

CBF-IDS: Addressing Class Imbalance Using CNN-BiLSTM with Focal Loss in Network Intrusion Detection System

Malicious URL Detection Based on Improved Multilayer Recurrent Convolutional Neural Network Model

Detecting Malicious Web Requests Using an Enhanced TextCNN.

Accurate salient object detection via dense recurrent connections and residual-based hierarchical feature integration.

Research on phishing webpage detection technology based on CNN-BiLSTM algorithm

HDCBAN: Hybrid Neural Network for Network Intrusion Detection System

Malicious URL Detection via Pretrained Language Model Guided Multi-Level Feature Attention Network

Cascaded capsule twin attentional dilated convolutional network for malicious URL detection

Feature fusion-based malicious code detection with dual attention mechanism and BiLSTM

Mass fainting at rock concerts.

Phishing Websites Detection Via CNN and Multi-Head Self-Attention on Imbalanced Datasets

Network Intrusion Detection Method Based on CNN-BiLSTM-Attention Model

Malicious Code Classification Method Based on Deep Residual Network and Hybrid Attention Mechanism for Edge Security

Detecting phishing websites through improving convolutional neural networks with Self-Attention mechanism

Network Intrusion Detection Model Based on CNN and GRU

A Malicious URL Detection Method Based on CNN

A Hybrid Deep Learning Model for Malicious Behavior Detection

BiTCN-TAEfficientNet Malware Classification Approach Based on Sequence and RGB Fusion

TGA: A Novel Network Intrusion Detection Method Based on TCN, BiGRU and Attention Mechanism

Detecting command injection attacks in web applications based on novel deep learning methods