Host identification based on self-similarity of network activity

Lisheng Huang,Guanling Zhao,Lu Li,Fengjun Zhang
DOI: https://doi.org/10.1016/j.comcom.2022.05.017
IF: 5.047
2022-07-01
Computer Communications
Abstract:The randomness and variability of IP addresses challenge the identity uniqueness of internet hosts. Accurately identifying internet hosts on the premise of protecting users’ privacy is difficult. In this paper, we demonstrate that the network behaviour characteristics of internet hosts often have self-similar characteristics, and propose a new method for host identification based on network activity self-similarity (NASS). In this method, the multidimensional behaviour features of internet hosts are collected from network traffic, and the time series of behaviour features are constructed. Then, after noise reduction, Mahalanobis distance is applied to measure the distance between the time series of different time windows of any two hosts. The distance measurement results are ranked, and host identification is realized according to the ranking. NASS can accurately identify the network host without violating user privacy and is suitable for encrypted communication environments. The experimental results show that the accuracy of NASS is 83.67%.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?