Anomaly Detection in the Open World: Normality Shift Detection, Explanation, and Adaptation.

Dongqi Han,Zhiliang Wang,Wenqi Chen,Kai Wang,Rui Yu,Su Wang,Han Zhang,Zhihua Wang,Minghui Jin,Jiahai Yang,Xingang Shi,Xia Yin
DOI: https://doi.org/10.14722/ndss.2023.24830
2023-01-01
Abstract:Concept drift is one of the most frustrating challenges for learning-based security applications built on the closeworld assumption of identical distribution between training and deployment.Anomaly detection, one of the most important tasks in security domains, is instead immune to the drift of abnormal behavior due to the training without any abnormal data (known as zero-positive), which however comes at the cost of more severe impacts when normality shifts.However, existing studies mainly focus on concept drift of abnormal behaviour and/or supervised learning, leaving the normality shift for zero-positive anomaly detection largely unexplored.In this work, we are the first to explore the normality shift for deep learning-based anomaly detection in security applications, and propose OWAD, a general framework to detect, explain, and adapt to normality shift in practice.In particular, OWAD outperforms prior work by detecting shift in an unsupervised fashion, reducing the overhead of manual labeling, and providing better adaptation performance through distribution-level tackling.We demonstrate the effectiveness of OWAD through several realistic experiments on three security-related anomaly detection applications with long-term practical data.Results show that OWAD can provide better adaptation performance of normality shift with less labeling overhead.We provide case studies to analyze the normality shift and provide operational recommendations for security applications.We also conduct an initial real-world deployment on a SCADA security system.1 Normality shift intuitively refers to the change of distribution of normal data (detailed definition is in §II-C).In this paper, we interchangeably use terms "drift" and "shift".We tend to use "normality shift" as a whole term.
What problem does this paper attempt to address?