Image-based malware detection based on convolution neural network with autoencoder in Industrial Internet of Things using Software Defined Networking Honeypot

Sanjeev Kumar,Anil Kumar
DOI: https://doi.org/10.1016/j.engappai.2024.108374
IF: 8
2024-04-13
Engineering Applications of Artificial Intelligence
Abstract:Technology-enabled intelligent automation has ushered in a new era of Industry 4.0. At the core of this revolution are many interconnected devices, commonly referred to as the Industrial Internet of Things (IIoT). On the one hand, these sensors have resulted in productivity improvements, but at the same time, they have created a new attack surface for the threat actors. This research presents a novel image-based malware detection system leveraging software-defined networking (SDN) honeypot, convolution neural networks (CNN), and a two-level autoencoder. The proposed system transforms binary programs into gray-scale images and extracts textural features using deep CNN architectures via transfer learning. The dimensions of extracted features are then reduced using a custom two-level autoencoder. Finally, these reduced features are fed to six algorithms, including deep learning (DL) and machine learning (ML) classifiers. This study performs hyper-parameter optimization of each classifier using automated grid-search and random-search algorithms. The experimental results performed using the public benchmarked MalImg dataset show that the proposed method outperforms existing approaches, achieving high test accuracy of 98.50% and weighted precision, recall, and f1-score of 98.60%, 98.60%, and 98.60%, respectively. Furthermore, the mean response time is reported as 0.006 s, indicating the fast detection capabilities of the proposed approach. One of the advantages of the proposed methodology is that it reduces the dependence on intensive feature engineering and minimizes the need for domain expertise. The proposed method offers a scalable and industry-suitable solution for detecting malware in IIoT environments without relying on computationally intensive techniques.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
What problem does this paper attempt to address?