Performance anomaly detection using isolation‐trees in heterogeneous workloads of web applications in computing clouds

Sara Kardani‐Moghaddam,Rajkumar Buyya,Kotagiri Ramamohanarao
DOI: https://doi.org/10.1002/cpe.5306
2019-04-16
Concurrency and Computation: Practice and Experience
Abstract:Cloud computing is a model for on‐demand access to shared resources based on the pay‐per‐use policy. In order to efficiently manage the resources, a continuous analysis of the operational state of the system is required to be able to detect the performance degradations and malfunctioned resources as soon as possible. Every change in the workload, hardware condition, or software code can change the state of the system from normal to abnormal, which causes the performance and quality of service degradations. These changes or anomalies vary from a simple gradual increase in the load to flash crowds, hardware faults, software bugs, etc. In this paper, we propose Isolation‐Forest based anomaly detection (IFAD) framework based on the unsupervised Isolation technique for anomaly detection in a multi‐attribute space of performance indicators for web‐based applications. Unsupervised nature of the algorithm and its fast execution make this algorithm most suitable for the environments with dynamic nature where the patterns of data change frequently. The experiment results demonstrate that IFAD can achieve good detection accuracy especially in terms of precision for multiple types of the anomaly. Moreover, we show the importance of validating the accuracy of anomaly detection algorithms with regard to both Area Under the Curve (AUC) and Precision‐Recall AUC (PRAUC) in an extensive set of comparisons including multiple unsupervised algorithms. The demonstration of the effectiveness of each algorithm shown by PRAUC results indicates the importance of PRAUC in selecting suitable anomaly detection algorithm, which is largely ignored in the literature.
What problem does this paper attempt to address?