A Failure Detection Solution for Multiple QoS in Data Center Networks

Kai Shen,Renke Wu,Haojie Zhou,Haibo Yu,Hao Zhong
DOI: https://doi.org/10.1145/2993717.2993728
2016-01-01
Abstract:Failures in data center networks sometimes can lead to user perceived service interruptions. Automated failure detection is needed to maintain the reliability of data centers. However, researches rarely identify quality of service (QoS) multiplicity for failure detection in data center networks.In this paper, to tackle this problem, we first divide network devices into two categories: imperative devices whose failures need to be detected in realtime, and non-imperative ones. Consequently, we leverage a co-detection approach named K-detectors and a data mining based approach to detect failures of these two kinds of devices respectively.We evaluated our approach on a simulated network built by ns-3. The experimental results show that for servers, query accuracy probability improves 4.62% with detection time increasing slightly; for links, discrimination improves significantly (nearly 86%).
What problem does this paper attempt to address?