Abstract:Due to the recent advances in the Internet and communication technologies, network systems and data have evolved rapidly. The emergence of new attacks jeopardizes network security and make it really challenging to detect intrusions. Multiple network attacks by an intruder are unavoidable. Our research targets the critical issue of class imbalance in intrusion detection, a reflection of the real-world scenario where legitimate network activities significantly out number malicious ones. This imbalance can adversely affect the learning process of predictive models, often resulting in high false-negative rates, a major concern in Intrusion Detection Systems (IDS). By focusing on datasets with this imbalance, we aim to develop and refine advanced algorithms and techniques, such as anomaly detection, cost-sensitive learning, and oversampling methods, to effectively handle such disparities. The primary goal is to create models that are highly sensitive to intrusions while minimizing false alarms, an essential aspect of effective IDS. This approach is not only practical for real-world applications but also enhances the theoretical understanding of managing class imbalance in machine learning. Our research, by addressing these significant challenges, is positioned to make substantial contributions to cybersecurity, providing valuable insights and applicable solutions in the fight against digital threats and ensuring robustness and relevance in IDS development. An intrusion detection system (IDS) checks network traffic for security, availability, and being non-shared. Despite the efforts of many researchers, contemporary IDSs still need to further improve detection accuracy, reduce false alarms, and detect new intrusions. The mean convolutional layer (MCL), feature-weighted attention (FWA) learning, a bidirectional long short-term memory (BILSTM) network, and the random forest algorithm are all parts of our unique hybrid model called MCL-FWA-BILSTM. The CNN-MCL layer for feature extraction receives data after preprocessing. After convolution, pooling, and flattening phases, feature vectors are obtained. The BI-LSTM and self-attention feature weights are used in the suggested method to mitigate the effects of class imbalance. The attention layer and the BI-LSTM features are concatenated to create mapped features before feeding them to the random forest algorithm for classification. Our methodology and model performance were validated using NSL-KDD and UNSW-NB-15, two widely available IDS datasets. The suggested model's accuracies on binary and multi-class classification tasks using the NSL-KDD dataset are 99.67% and 99.88%, respectively. The model's binary and multi-class classification accuracies on the UNSW-NB15 dataset are 99.56% and 99.45%, respectively. Further, we compared the suggested approach with other previous machine learning and deep learning models and found it to outperform them in detection rate, FPR, and F-score. For both binary and multiclass classifications, the proposed method reduces false positives while increasing the number of true positives. The model proficiently identifies diverse network intrusions on computer networks and accomplishes its intended purpose. The suggested model will be helpful in a variety of network security research fields and applications.

Shafiq, M and Yu, X and Bashir, AK and Chaudhry, HN and Wang, D (2018)A Machine Learning Approach for Feature Selection Traffic Classification Using Security

An Efficient Traffic Classification Scheme Using Embedded Feature Selection and LightGBM

Optimizing Feature Selection for Efficient Encrypted Traffic Classification: A Systematic Approach

Network Intrusion Detection Through Discriminative Feature Selection by Using Sparse Logistic Regression

A Novel Method for Feature Learning and Network Intrusion Classification

Leveraging Metaheuristics for Feature Selection With Machine Learning Classification for Malicious Packet Detection in Computer Networks

Hybrid feature selection-based machine learning Classification system for the prediction of injury severity in single and multiple-vehicle accidents

Improved Feature Selection and Stream Traffic Classification Based on Machine Learning in Software-Defined Networks

A Machine Learning-Based Framework with Enhanced Feature Selection and Resampling for Improved Intrusion Detection

Explainable artificial intelligence for feature selection in network traffic classification: A comparative study

IP Traffic Classification Based on Machine Learning

An Improved Network Traffic Classification Model Based on a Support Vector Machine

An SVM-based machine learning method for accurate internet traffic classification

Enhancing Efficiency and Privacy in Memory-Based Malware Classification through Feature Selection

Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique

Impact of Feature Selection Methods on Data Classification for IDS

A hybrid feature weighted attention based deep learning approach for an intrusion detection system using the random forest algorithm

Fast and Robust Online Traffic Classification Supporting Unseen Applications

Classification of Firewall Log Data Using Multiclass Machine Learning Models

A Multimodal Network Security Framework for Healthcare Based on Deep Learning