Abstract:Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training steps. In this paper, our contributions are two-fold. On the one hand, we propose a flexible multi-channel framework to generate multi-type frame-level features. On the other hand, we study how it is possible to improve the detection performance by supervised learning. The multi-channel framework is based on four Conditional GANs (CGANs) taking various type of appearance and motion information as input and producing prediction information as output. These CGANs provide a better feature space to represent the distinction between normal and abnormal events. Then, the difference between those generative and ground-truth information is encoded by Peak Signal-to-Noise Ratio (PSNR). We propose to classify those features in a classical supervised scenario by building a small training set with some abnormal samples of the original test set of the dataset. The binary Support Vector Machine (SVM) is applied for frame-level anomaly detection. Finally, we use Mask R-CNN as detector to perform object-centric anomaly localization. Our solution is largely evaluated on Avenue, Ped1, Ped2, and ShanghaiTech datasets. Our experiment results demonstrate that PSNR features combined with supervised SVM are better than error maps computed by previous methods. We achieve state-of-the-art performance for frame-level AUC on Ped1 and ShanghaiTech. Especially, for the most challenging Shanghaitech dataset, a supervised training model outperforms up to 9% the state-of-the-art an unsupervised strategy.

Tam-Net: Temporal Enhanced Appearance-To-Motion Generative Network For Video Anomaly Detection

Anomaly Detection in Traffic Surveillance Videos with GAN-based Future Frame Prediction

TransGANomaly: Transformer based Generative Adversarial Network for Video Anomaly Detection

What goes around comes around: Cycle-Consistency-based Short-Term Motion Prediction for Anomaly Detection using Generative Adversarial Networks

Learning Appearance-motion Normality for Video Anomaly Detection.

Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection

Spatiotemporal consistency-enhanced network for video anomaly detection

Spatial-Temporal Graph Convolutional Network Boosted Flow-Frame Prediction for Video Anomaly Detection

Context-related video anomaly detection via generative adversarial network

Video Anomaly Detection using GAN

3D U-Net for Video Anomaly Detection.

Attention-guided generator with dual discriminator GAN for real-time video anomaly detection

Learning Appearance-Motion Synergy Via Memory-Guided Event Prediction for Video Anomaly Detection

Learning Attention Augmented Spatial-temporal Normality for Video Anomaly Detection

Detecting abnormality with separated foreground and background: Mutual Generative Adversarial Networks for video abnormal event detection

CVAD-GAN: Constrained video anomaly detection via generative adversarial network

Integrated Multiscale Appearance Features and Motion Information Prediction Network for Anomaly Detection

Appearance-Motion Memory Consistency Network for Video Anomaly Detection

Appearance-Motion united Auto-Encoder Framework for Video Anomaly Detection

Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos

Object-Guided and Motion-Refined Attention Network for Video Anomaly Detection