Learning Spatial Graph Structure for Multivariate KPI Anomaly Detection in Large-Scale Cyber-Physical Systems.

Haiqi Zhu,Seungmin Rho,Shaohui Liu,Feng Jiang
DOI: https://doi.org/10.1109/tim.2023.3284920
IF: 5.6
2023-01-01
IEEE Transactions on Instrumentation and Measurement
Abstract:Anomaly detection on multivariate key performance indicators (KPIs) is a key procedure for the quality and reliability of large-scale cyber-physical systems (CPSs). Although extensive efforts have been paid in learning normal data distributions, the spatial dependence of different dimensional KPIs is barely explored to reasonably represent the complexity and time-varying nature of systems. In this article, we propose to model the spatial dependence of multivariate KPI by combining a more reasonable graph learning method with a graph attention mechanism to obtain the complex spatial dependence in an unsupervised manner. First, we transform the multivariate KPI into graph structures with a specially designed KPI graph learning module. Second, the graph attention mechanism extracts the spatial dependence in the KPI graphs. Finally, our method jointly trains forecasting-based model and reconstruction-based model to detect anomalies. Through a large number of related experiments on four real-world datasets, we demonstrate the feasibility of our method and the F1-score improves by 9% over the baseline model. Further analysis shows that the graph learning method in this article can more reasonably describe the dependence between multivariate KPI, and the graph attention mechanism can more accurately capture the correlation between them, which is helpful for fault diagnosis.
What problem does this paper attempt to address?