A Transformer-Based Framework for Payload Malware Detection and Classification

Kyle Stein,Arash Mahyari,Guillermo Francia III,Eman El-Sheikh

2024-03-27

Abstract:As malicious cyber threats become more sophisticated in breaching computer networks, the need for effective intrusion detection systems (IDSs) becomes crucial. Techniques such as Deep Packet Inspection (DPI) have been introduced to allow IDSs analyze the content of network packets, providing more context for identifying potential threats. IDSs traditionally rely on using anomaly-based and signature-based detection techniques to detect unrecognized and suspicious activity. Deep learning techniques have shown great potential in DPI for IDSs due to their efficiency in learning intricate patterns from the packet content being transmitted through the network. In this paper, we propose a revolutionary DPI algorithm based on transformers adapted for the purpose of detecting malicious traffic with a classifier head. Transformers learn the complex content of sequence data and generalize them well to similar scenarios thanks to their self-attention mechanism. Our proposed method uses the raw payload bytes that represent the packet contents and is deployed as man-in-the-middle. The payload bytes are used to detect malicious packets and classify their types. Experimental results on the UNSW-NB15 and CIC-IOT23 datasets demonstrate that our transformer-based model is effective in distinguishing malicious from benign traffic in the test dataset, attaining an average accuracy of 79\% using binary classification and 72\% on the multi-classification experiment, both using solely payload bytes.

Cryptography and Security,Artificial Intelligence,Machine Learning

What problem does this paper attempt to address?

The paper aims to address the problem of malware detection and classification in Intrusion Detection Systems (IDS). Specifically, the researchers propose a Transformer-based Deep Packet Inspection (DPI) algorithm to analyze the content of network packets to identify malicious traffic and further classify this malicious traffic. The main contributions of the paper include: 1. **Proposing a new DPI algorithm**: This algorithm is based on the Transformer model and can capture complex patterns and dependencies in the raw payload bytes of network packets through the self-attention mechanism. 2. **Effectively handling the challenges posed by encrypted traffic**: Although encrypted traffic hides useful information in packets, this method performs well on unencrypted data, effectively distinguishing between malicious and benign traffic. 3. **Experimental validation**: The research used two well-known and widely used datasets, UNSW-NB15 and CIC-IOT23, for experimental validation. The results show that in binary classification tasks, the proposed model achieved an average accuracy of 79%, and in multi-class classification tasks, the accuracy was 72%. Additionally, the paper discusses the impact of encrypted traffic on malware detection, pointing out that encryption algorithms like AES can effectively hide patterns in packets, while some encryption algorithms may not be strong enough and could still reveal characteristics of malware. Finally, by comparing the performance of different models (including 1D Convolutional Neural Networks (1D-CNN), 2D Convolutional Neural Networks (2D-CNN), and Long Short-Term Memory Networks (LSTM)), it is demonstrated that the proposed Transformer-based method has higher accuracy and robustness in malware detection.

A Transformer-Based Framework for Payload Malware Detection and Classification

Revolutionizing Payload Inspection: A Self-Supervised Journey to Precision with Few Shots

A Lean Transformer Model for Dynamic Malware Analysis and Detection

Survey of Transformer-Based Malicious Software Detection Systems

Transformer-Based Malicious Traffic Detection for Internet of Things

TransMalDE: an Effective Transformer Based Hierarchical Framework for IoT Malware Detection

Channel Features and API Frequency-Based Transformer Model for Malware Identification

MalBERT: Using Transformers for Cybersecurity and Malicious Software Detection

AdaTrans: An adaptive transformer for IoT Malware detection based on sensitive API call graph and inter-component communication analysis

Enhanced Image-Based Malware Classification Using Transformer-Based Convolutional Neural Networks (CNNs)

EarlyMalDetect: A Novel Approach for Early Windows Malware Detection Based on Sequences of API Calls

Interpretable Detection of Malicious Behavior in Windows Portable Executables Using Multi-Head 2D Transformers

Malicious Source Code Detection Using Transformer

Towards Novel Malicious Packet Recognition: A Few-Shot Learning Approach

Deep learning-based improved transformer model on android malware detection and classification in internet of vehicles

An Efficient DenseNet-Based Deep Learning Model for Malware Detection

DeepImageDroid: A Hybrid Framework Leveraging Visual Transformers and Convolutional Neural Networks for Robust Android Malware Detection

Cyber-Threat Detection System Using a Hybrid Approach of Transfer Learning and Multi-Model Image Representation

CyberSentinel: A Transparent Defense Framework for Malware Detection in High-Stakes Operational Environments

A Convolutional Transformation Network for Malware Classification

Accelerating Malware Classification: A Vision Transformer Solution