The Deep Radial Basis Function Data Descriptor (D-RBFDD) Network: A One-Class Neural Network for Anomaly Detection

Mehran Hossein Zadeh Bazargani,Arjun Pakrashi,Brian Mac Namee
DOI: https://doi.org/10.1109/access.2022.3187961
IF: 3.9
2022-07-15
IEEE Access
Abstract:Anomaly detection is a challenging problem in machine learning, and is made even more so when dealing with instances that are captured in low-level, raw data representations without a well-known and well-behaved set of engineered features. Images or data streams from sensors are good examples of such low-level, raw data representations. The Radial Basis Function Data Descriptor (RBFDD) network is an effective solution for anomaly detection, however, it is a shallow model that does not deal well with low-level, raw data representations. This article investigates approaches to transform an RBFDD network into a deep one-class classifier that works well for anomaly detection problems with low-level, raw data representations. We show that approaches based on simple transfer learning are not effective and our results suggest that this is because the latent representations learned by generic classification models are not suitable for anomaly detection. We show that an alternative approach that adds multiple convolutional layers before the Radial Basis Function (RBF) layer of an RBFDD network—–to form a Deep Radial Basis Function Data Descriptor (D-RBFDD) network—is very effective. This is demonstrated in a set of evaluation experiments based on multiple anomaly detection scenarios created from publicly available image classification datasets, and a real-world anomaly detection dataset in which different types of arrhythmia are detected in electrocardiogram (ECG) data. Our experiments show that the D-RBFDD network out-performs state-of-the-art anomaly detection methods including the Deep Support Vector Data Descriptor (Deep SVDD), One-Class Support Vector Machine (OCSVM), and Isolation Forest on the image datasets, and produces competitive results on the ECG dataset.
What problem does this paper attempt to address?