Design and Evaluation of an Online Anomaly Detector for Distributed Storage Systems.

Xin Chen,Xubin He,He Guo,Yuxin Wang
DOI: https://doi.org/10.4304/jsw.6.12.2379-2390
2011-01-01
Abstract:Performance problems, which may stem from different system components, such as network, memory, and storage devices, are difficult to diagnose and isolate in distributed storage systems. In this paper, we present a performance anomaly detector which is able to efficiently detect performance anomaly and accurately identify the faulty sources in a system node of a distributed storage system. Our method exploits the stable relationship between workloads and system resource statistics to detect the performance anomaly and identify faulty sources which cause the performance anomaly in the system. Our experimental results demonstrate the efficiency and accuracy of the proposed performance anomaly detector.
What problem does this paper attempt to address?