Abstract:Anomaly detection in videos is the task of identifying frames from a video sequence that depict events that do not conform to expected behavior, which is an extremely challenging task due to the ambiguous and unbounded properties of anomalies. With the development of deep learning, video anomaly detection methods based on deep neural networks have made great progress. The existing methods mainly follow two routes, namely, frame reconstruction and frame prediction. Due to the powerful generalization ability of neural networks, the application of reconstruction-based methods is limited. Recently, anomaly detection methods based on prediction have achieved advanced performance. However, their performance suffers when they cannot guarantee lower prediction errors for normal events. In this paper, we propose a novel future frame prediction model based on a bidirectional retrospective generation adversarial network (BR-GAN) for anomaly detection. To predict a future frame with higher quality for normal events, first, we propose a bidirectional prediction combined with a retrospective prediction method to fully mine the bidirectional temporal information between the predicted frame and the input frame sequence. Then, the intensity and gradient loss between the predicted frame and the actual frame together with an adversarial loss are used for appearance (spatial) constraints. In addition, we propose a sequence discriminator composed of a 3-dimensional (3D) convolutional neural network to capture the long-term temporal relationships between frame sequences composed of predicted frames and input frames; this network plays a crucial role in maintaining the motion (temporal) consistency of the predicted frames for normal events. Such appearance and motion constraints further facilitate future frame prediction for normal events, and thus, the prediction network can be highly capable of distinguishing normal and abnormal patterns. Extensive experiments on benchmark datasets demonstrate that our method outperforms most existing state-of-the-art methods, validating the effectiveness of our method for anomaly detection.

Recurrent Adversarial Video Prediction Network

Sequential Video VLAD: Training the Aggregation Locally and Temporally

Z-Order Recurrent Neural Networks For Video Prediction

Deep RNN Framework for Visual Sequential Applications

PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs

Long-Term Prediction of Natural Video Sequences with Robust Video Predictors

Self-supervised Generative Learning for Sequential Data Prediction

PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning

ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning

Structure Preserving Video Prediction

Bidirectional Retrospective Generation Adversarial Network for Anomaly Detection in Videos.

Video Anomaly Detection Using Dual Discriminator Based Generative Adversarial Network.

Scene-Dependent Prediction in Latent Space for Video Anomaly Detection and Anticipation

Patch Spatio-Temporal Relation Prediction for Video Anomaly Detection

Long Short-Term Dynamic Prototype Alignment Learning for Video Anomaly Detection

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Exploiting Spatial-temporal Correlations for Video Anomaly Detection

Folded Recurrent Neural Networks for Future Video Prediction

Long-Term Video Prediction Via Criticization and Retrospection

BEAVP: A Bidirectional Enhanced Adversarial Model for Video Prediction