Performance Impact Inference with Failures in Data Center Networks

Che Zhang,Hong Xu,Chengchen Hu
DOI: https://doi.org/10.1109/iccchina.2016.7636829
2016-01-01
Abstract:Maintaining a data center network (DCN) is crucial to many services running on top of it, especially given its large scale with tens of thousands of network components. In this paper, we propose a method to infer performance change before failures really happen in data center networks, called Sibyl. Different from previous work, Sibyl relies on network topology information to infer network performance under failure scenarios without the overhead of active measurements. Specifically, we demonstrate that most important performance metrics can be obtained from two fundamental topological metrics. We develop efficient algorithms to obtain these two fundamental metrics, leveraging graph automorphism of various DCN topologies.
What problem does this paper attempt to address?