Automatic Detecting Performance Bugs in Cloud Computing Systems via Learning Latency Specification Model

Haibo Mi,Huaimin Wang,Zhenbang Chen,Yangfan Zhou
DOI: https://doi.org/10.1109/SOSE.2014.43
2014-01-01
Abstract:Performance bugs that don't cause fail-stop errors but degradation of system performance have been one of the most fundamental issues in the production platform. How to effectively online detect bugs becomes more and more urgent for engineers. Performance bugs usually manifest themselves as the anomalous call structures of request traces or anomalous latencies of invoked methods. In this paper, we propose an automatic performance bug online detecting approach, CloudDoc. CloudDoc maintains a performance model mined from execution traces that are collected in the normal period. The performance model captures the characteristics of call structures of request traces together with corresponding latencies. With the performance model, CloudDoc periodically detects whether performance bugs occur or not. All suspicious call structures or latency-abnormal invoked methods are presented to engineers. We report two case studies to demonstrate the effectiveness of CloudDoc in helping engineers identify performance bugs.
What problem does this paper attempt to address?