A comparative analysis of feature selection for IoT device identification in machine learning algorithms

Zhiming Guo
DOI: https://doi.org/10.54254/2755-2721/73/20240373
2024-07-05
Abstract:In recent years, IoT security has become a critical concern, prompting the integration of artificial intelligence and cybersecurity. State-of-the-art research has focused on applying machine learning and deep learning techniques, emphasizing the crucial role of data preprocessing for effective results. One of the most common and significant data preparation techniques, feature selection is now an essential step in the machine learning process. It involves finding pertinent characteristics and deleting unimportant, low-correlation, redundant, or noisy data. In peoples daily lives, noise can interfere with the sound of our normal lives. Similarly, in machine learning, they can also interfere with the process of correctly recognizing the laws of things, leading to the influence of many parameters such as learning rate. This feature selection process improves predictive accuracy and increases comprehensibility. This paper evaluates the performance of four machine learning algorithms for embedded feature selection and compares them. In the end, it was found that XGBoost outperformed the other three algorithms in terms of performance. After feature selection, the accuracy remained almost unchanged, and even with only a few features (about 10% of the original data), it still had a high accuracy
What problem does this paper attempt to address?