Performance problems online detection in cloud computing systems via analyzing request execution paths

Haibo Mi,Huaimin Wang,Gang Yin,Hua Cai,Qi Zhou,Tingtao Sun
DOI: https://doi.org/10.1109/DSNW.2011.5958799
2011-01-01
Abstract:It is quite a headache for developers to online detect performance problems in large-scale cloud computing systems. The behavior and the hidden connections among the huge amount of runtime request execution paths in cloud computing systems usually contain useful information for performance problem detection. In this paper, we propose an approach to rapidly diagnose the source of performance degradation in large-scale non-stop cloud computing systems. The approach first groups the user requests into categories with a fast clustering algorithm; then applies the principal components analysis to extract the primary methods; finally compares the normal and abnormal behaviors of the primary methods to localize the main cause of performance problems. We conduct extensive experiments over a real-world enterprise system providing services for the public. The results show that our approach can locate the prime causes of performance problems accurately and efficiently.
What problem does this paper attempt to address?