CNN based IoT Device Identification

Kahraman Kostas
2023-04-27
Abstract:While the use of the Internet of Things is becoming more and more popular, many security vulnerabilities are emerging with the large number of devices being introduced to the market. In this environment, IoT device identification methods provide a preventive security measure as an important factor in identifying these devices and detecting the vulnerabilities they suffer from. In this study, we present a method that identifies devices in the Aalto dataset using the convolutional neural network (CNN).
Cryptography and Security,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the issue of security vulnerability identification brought about by the significant increase in Internet of Things (IoT) devices in the market. Specifically, with the proliferation of IoT devices, many new security threats have also emerged. To tackle this challenge, the authors propose a method based on Convolutional Neural Networks (CNN) to identify the identities of different IoT devices in the Aalto dataset. By converting network data into pseudo-image form and then analyzing it using CNN, this method can serve as a preventive security measure to help identify these devices and their potential security risks. The paper mainly discusses how to transform network traffic into pseudo-images and demonstrates the specific implementation of this process through different research cases. For example, Lim et al. generated fixed-size pseudo-images by converting the payload part of network data packets into binary digits; Lotfollahi et al. obtained uniform-sized pseudo-images for CNN analysis by standardizing and zero-padding the data packets. Additionally, Wang et al. converted different types of sessions and application layer data into 784-pixel pseudo-images to input into the CNN model. The authors of this paper chose a method similar to that of Wang et al., extracting the first 784 bytes of each data packet's content as input and focusing only on the payload part rather than other header information to maintain data consistency. Ultimately, the authors compared the performance of the payload-based CNN model with the fingerprint feature set-based CNN model. The results showed that both were very close in accuracy, but the fingerprint-based method was significantly faster in processing speed.