Hard Anomaly Detection: an Adversarial Data Augmentation Solution

Hu Teng,Cheng Wang,Qing Yang,Xue Chen
DOI: https://doi.org/10.1109/icdmw60847.2023.00140
2023-01-01
Abstract:We propose a measure to divide anomaly detection into four categories. At the same time, we focus on a special and representative type of anomaly detection problem, called hard anomaly detection. Compared with ordinary anomaly detection where anomalies usually appear as outliers, the so-called hard anomaly has two characteristics in terms of the differences between abnormal and normal samples: 1) the boundary is exceedingly unobvious, and 2) the distribution is extremely imbalanced, both of which result in the difficulty of anomaly detection. It is really hard for regular classifiers to find the boundary between abnormal and normal samples here. To address the issue of hard anomaly detection, in this work, we first propose a quantitative definition of the hard anomaly according to detection difficulty and then design a dedicated solution framework, named Hard Anomaly Detection (HAD). Under the framework HAD, we devise a GAN-based (Generative Adversarial Network) method, called HadGAN, which produces both abnormal and normal samples with a similar distribution. We then pre-train a base anomaly detection model using the data generated by HadGAN. Moreover, we apply transfer learning to fine-tune the base model on real datasets. The superiority of our solution is demonstrated both by theoretical analysis as well as in extensive experiments.
What problem does this paper attempt to address?