SPAE: Lifelong disk failure prediction via end-to-end GAN-based anomaly detection with ensemble update
Yu Liu,Yunchuan Guan,Tianming Jiang,Ke Zhou,Hua Wang,Guangxing Hu,Ji Zhang,Wei Fang,Zhuo Cheng,Ping Huang
DOI: https://doi.org/10.1016/j.future.2023.05.020
IF: 7.307
2023-06-19
Future Generation Computer Systems
Abstract:Disk failure prediction aims to predict upcoming disk failures in advance for high data reliability. There are numerous supervised machine learning methods that are successful in predicting disk failure using SMART properties as input. However, these approaches heavily rely on a substantial number of annotated failed disks, resulting in degraded prediction performance caused by scarce failed disks at the beginning, also known as the cold start problem. Inspired by the success achieved in Generative Adversarial Network (GAN) based anomaly detection, this paper translates disk failure prediction into an anomaly detection problem. Specifically, we developed a S emi-supervised method for lifelong disk failure P rediction via A dversarial training and E nsemble update, called SPAE. The advantage of SPAE over existing supervised approaches is that SPAE can train the prediction model using only healthy disks, avoiding the cold start problem. Furthermore, SPAE can be updated using ensemble learning on emerging failed disks to resist the model aging problem. Compared to state-of-the-art methods using supervised machine learning on real-world datasets, SPAE predicts disk failures with higher accuracy for the full lifetime of models, i . e ., both the startup period and the long-term usage.
computer science, theory & methods