FluxInfer: Automatic Diagnosis of Performance Anomaly for Online Database System

Ping Liu,Shenglin Zhang,Yongqian Sun,Yuan Meng,Jiahai Yang,Dan Pei
DOI: https://doi.org/10.1109/ipccc50635.2020.9391550
2020-01-01
Abstract:The root cause diagnosis of performance anomaly for online database anomalies is challenging due to diverse types of database engines, different operational modes, and variable anomaly patterns. To relieve database operators from manual anomaly diagnosis and alarm storm, we propose FluxInfer, a framework to accurately and rapidly localize root cause related KPIs for database performance anomaly. It first constructs a Weighted Undirected Dependency Graph (WUDG) to represent the dependency relationships of anomalous KPIs accurately, and then applies a weighted PageRank algorithm to localize root cause related KPIs. The testbed evaluation experiments show that the AC@3, AC@5, and Avg@5 of FluxInfer are 0.90, 0.95, and 0.77, outperforming nine baselines by 64%, 60%, and 53% on average, respectively.
What problem does this paper attempt to address?