A Copula-Driven Unsupervised Learning Framework for Anomaly Detection with Multivariate Heterogeneous Data

Swaroop Damodaran,Ram Padmanabhan,R. Maahin,Sanjeev Gurugopinath
DOI: https://doi.org/10.1109/mlsp52302.2021.9596359
2021-10-25
Abstract:We consider the problem of anomaly detection with heterogeneous and correlated multivariate data, with no assumption on the knowledge of statistical correlation. First, we use a copula-based approach to measure the statistical correlation among the modalities. Next, we employ an unsupervised learning (UL) framework for anomaly detection, using data points sampled from the copula-based joint distribution. In particular, we consider Gaussian, R-Vine, D-Vine and C-Vine copula techniques, with isolation forest, one-class SVM, local outlier factor, elliptic envelope and autoencoder UL algorithms, for our extensive study. Through Monte Carlo simulations and an experimental study on the IEEE signal processing cup – 2020 dataset, we show that the proposed framework significantly outperforms the direct training method, in terms of detection accuracy. Furthermore, we show that the C-Vine-based autoencoder technique yields the best performance in comparison with other techniques, in terms of area under the receiver operating characteristics curve, and accuracy of detecting anomalies.
What problem does this paper attempt to address?