Log-Based OpenStack Fault Diagnosis by Machine Learning

Leyi Zhang,Lei Fan,Naiwang Guo
DOI: https://doi.org/10.1088/1742-6596/1069/1/012111
2018-01-01
Journal of Physics Conference Series
Abstract:With the rapid development of cloud computing technology, OpenStack is widely used. Since failures often occur on cloud platforms, how to detect OpenStack system failures becomes an important issue. This paper proposes an algorithm for fault diagnosis, which requires only the raw logs. The raw logs are first unified and stored in the database. Then a well designed time window is selected to extract features. The extracted features are then used to locate the fault time period in the logs. Through the analyzing of the log fragment, the components of OpenStack system which account for the failure are determined. And detailed reasons are further confirmed. Experimental results showed that our proposed algorithm performed well in detecting OpenStack failures.
What problem does this paper attempt to address?