CIIR: an approach to handle class imbalance using a novel feature selection technique

Bidyapati Thiyam,Shouvik Dey
DOI: https://doi.org/10.1007/s10115-024-02126-2
IF: 2.7
2024-05-24
Knowledge and Information Systems
Abstract:The increasing vulnerability of systems and the rise in malicious events have sparked concerns about network security. In order to address these threats, network intrusion detection systems (NIDSs) play a role in protecting against malicious threats. However, IDSs often face obstacles, like the issue of imbalanced classes, which can hinder the effectiveness of machine learning models by giving preference to the majority class. To resolve this issue, many strategies such as resampling, cost-sensitive, and ensemble learning systems have been proposed, but no relevant metrics have been developed to investigate the influence of observed performance on the data-level approach. The proposed model introduced a new metric to study the impact of sampling for the classification algorithm. This paper presents a novel approach known as the CI IR (Causal Inference Imbalanced Ratio ) by utilizing ADASYN-IHT with Boruta-ROC feature selection in conjunction with four well-known imbalanced datasets: CIC-DDoS2019, UNSW-NB15, ML-EdgeIIoT and WUSTL-IIoT2021. The experimental outcomes prove the efficacy of the ADASYN-IHT and Boruta-ROC methods in improving classification performance on these datasets and by studying the impact of the CI IR .
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?