Advances in Machine Learning Based Network Traffic Classification

WANG Tao,YU Shun-zheng
DOI: https://doi.org/10.3969/j.issn.1000-1220.2012.05.021
2012-01-01
Abstract:ML(machine learning) employs statistical network flow characteristics to assist in the IP traffic classification identification and classification,which is different with traditional methods that depend on well known application port numbers or deeply inspecting the contents of packet payloads.ML-based network traffic classification has been researched widely and developed rapidly.This survey reviews the significant works that cover the dominant period since 2004,and categorize,analyze and compare them according to their choice of ML strategies which include supervised,unsupervised and semi-supervised learning algorithms.We importantly discuss the orientations and challenges for the employment of ML-based traffic classifiers in operational IP networks.More specifically,the key issues such as sample labeling bottleneck,skewed data distribution,real-time and continuous classification and scalability of classification algorithms are discussed.
What problem does this paper attempt to address?