An Online Performance Anomaly Detector in Cluster File Systems

Xin Chen,Xubin He,He Guo,Yuxin Wang
DOI: https://doi.org/10.1109/paap.2010.26
2010-01-01
Abstract:Performance problems, which can stem from different system components, such as network, memory, and storage devices, are difficult to diagnose and isolate in a cluster file system. In this paper, we present an online performance anomaly detector which is able to efficiently detect performance anomaly and accurately identify the faulty sources in a system node of a cluster file system. Our method exploits the stable relationship between workloads and system resource statistics to detect the performance anomaly and identify faulty sources which cause the performance anomaly in the system. Our preliminary experimental results demonstrate the efficiency and accuracy of the proposed performance anomaly detector.
What problem does this paper attempt to address?