Abstract:With the rapid development of network technology, the Internet has brought significant convenience to various sectors of society, holding a prominent position. Due to the unpredictable and severe consequences resulting from malicious attacks, the detection of anomalous network traffic has garnered considerable attention from researchers over the past few decades. Accurately labeling a sufficient amount of network traffic data as a training dataset within a short period of time is a challenging task, given the rapid and massive generation of network traffic data. Furthermore, the proportion of malicious attack traffic is relatively small compared to the overall traffic data, and the distribution of traffic data across different types of malicious attacks also varies significantly. To address the aforementioned challenges, this paper presents a novel network anomaly detection algorithm based on semi-supervised learning and adaptive multiclass balancing. Building upon the assumption of consistent distribution between labeled and unlabeled data, this paper introduces the multiclass split balancing strategy and the adaptive confidence threshold function. These innovative approaches aim to tackle the issue of the multiclass imbalanced in traffic data. By leveraging the mutually beneficial relationship between semi-supervised learning and ensemble learning, this paper presents the collaborative rotation forest algorithm. This algorithm is specifically designed to enhance performance of anomaly detection in an environment with label inadequacy. Several comparative experiments conducted on the NSL-KDD, UNSW-NB15, and ToN-IoT demonstrate that the proposed algorithm achieves significant improvements in performance. Specifically, it enhances precision by 1.5–5.7%, recall by 1.5−5.7%, and F-Measure by 1.4−4.3% compared to the state-of-the-art algorithms.

A semi-supervised network traffic classification method based on incremental learning

A Cost-Sensitive Deep Learning-Based Approach for Network Traffic Classification

A novel pattern recognition algorithm: Combining ART network with SVM to reconstruct a multi-class classifier

An Incremental Learning Algorithm Based on Support Vector Domain Classifier

An Improved Network Traffic Classification Model Based on a Support Vector Machine

A New Network Traffic Classification Method Based on Classifier Integration

An SVM-based machine learning method for accurate internet traffic classification

DRnet: Dynamic Retraining for Malicious Traffic Small-Sample Incremental Learning

Network traffic classification based on federated semi-supervised learning

Semi-supervised tri-Adaboost algorithm for network intrusion detection

Accurate Classification of the Internet Traffic Based on the SVM Method

Traffic Classification - Towards Accurate Real Time Network Applications

A semi-supervised load identification method with class incremental learning

A network anomaly detection algorithm based on semi-supervised learning and adaptive multiclass balancing

Svm-Based Analysis and Prediction on Network Traffic

A Novel Malware Traffic Classification Method Using Semi-Supervised Learning.

Semi-supervised Dynamic Counter Propagation Network

A method of combining traffic classification and traffic prediction based on machine learning in wireless networks

An Online Incremental Semi-Supervised Learning Method

Towards Time-Varying Classification Based on Traffic Pattern

Multitask Learning for Network Traffic Classification