The Infuence of Noisy Data on Skype Traffic Classification

Linhua Niu,Xiangzhan Yu,Zhimin Yin
DOI: https://doi.org/10.2991/icaise.2013.52
2013-01-01
Abstract:Because of its popularity, encrypted traffic and proprietary design, there has been difficult to detect Skype from other P2P traffics. The research of Skype traffic identification focuses on collecting traffic flow feature and using machine learning method to identification. The key of machine learning method is datasets and flow feature selection. Since there is no publicly available datasets, noisy data can't be avoided. In this paper, I compare two different machine learning classification techniques, C4.5 and Neural Networks. Results show that C4.5 is better than Neural Networks when noisy data percent is low and Neural Networks is steady when noisy data percent is high.
What problem does this paper attempt to address?