Automated Known Problem Diagnosis with Event Traces

Chun Yuan,Ni Lao,Ji-Rong Wen,Jiwei Li,Zheng Zhang,Yi-Min Wang,Wei-Ying Ma
DOI: https://doi.org/10.1145/1217935.1217972
2006-01-01
ACM SIGOPS Operating Systems Review
Abstract:Computer problem diagnosis remains a serious challenge to users and support professionals. Traditional troubleshooting methods relying heavily on human intervention make the process inefficient and the results inaccurate even for solved problems, which contribute significantly to user's dissatisfaction. We propose to use system behavior information such as system event traces to build correlations with solved problems, instead of using only vague text descriptions as in existing practices. The goal is to enable automatic identification of the root cause of a problem if it is a known one, which would further lead to its resolution. By applying statistical learning techniques to classifying system call sequences, we show our approach can achieve considerable accuracy of root cause recognition by studying four case examples.
What problem does this paper attempt to address?