Abstract:Anomaly detection in videos is the task of identifying frames from a video sequence that depict events that do not conform to expected behavior, which is an extremely challenging task due to the ambiguous and unbounded properties of anomalies. With the development of deep learning, video anomaly detection methods based on deep neural networks have made great progress. The existing methods mainly follow two routes, namely, frame reconstruction and frame prediction. Due to the powerful generalization ability of neural networks, the application of reconstruction-based methods is limited. Recently, anomaly detection methods based on prediction have achieved advanced performance. However, their performance suffers when they cannot guarantee lower prediction errors for normal events. In this paper, we propose a novel future frame prediction model based on a bidirectional retrospective generation adversarial network (BR-GAN) for anomaly detection. To predict a future frame with higher quality for normal events, first, we propose a bidirectional prediction combined with a retrospective prediction method to fully mine the bidirectional temporal information between the predicted frame and the input frame sequence. Then, the intensity and gradient loss between the predicted frame and the actual frame together with an adversarial loss are used for appearance (spatial) constraints. In addition, we propose a sequence discriminator composed of a 3-dimensional (3D) convolutional neural network to capture the long-term temporal relationships between frame sequences composed of predicted frames and input frames; this network plays a crucial role in maintaining the motion (temporal) consistency of the predicted frames for normal events. Such appearance and motion constraints further facilitate future frame prediction for normal events, and thus, the prediction network can be highly capable of distinguishing normal and abnormal patterns. Extensive experiments on benchmark datasets demonstrate that our method outperforms most existing state-of-the-art methods, validating the effectiveness of our method for anomaly detection.

BEAVP: A Bidirectional Enhanced Adversarial Model for Video Prediction

Adaptive Hierarchical Motion-Focused Model for Video Prediction.

Dual Motion GAN for Future-Flow Embedded Video Prediction

Bidirectional Transformer GAN for Long-term Human Motion Prediction

Probabilistic Video Prediction From Noisy Data With a Posterior Confidence.

HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction

BiHMP-GAN: Bidirectional 3D Human Motion Prediction GAN

Video Frame Prediction with Dual-Stream Deep Network Emphasizing Motions and Content Details.

State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend

Motion and Context-Aware Audio-Visual Conditioned Video Prediction

Jointy Predicting Future Sequence And Steering Angles For Dynamic Driving Scenes

Video Prediction Models as General Visual Encoders

Active Patterns Perceived for Stochastic Video Prediction

Generative Adversarial Network-Based Frame Extrapolation for Video Coding

Bidirectional Retrospective Generation Adversarial Network for Anomaly Detection in Videos.

Efficient Human Motion Prediction Using Temporal Convolutional Generative Adversarial Network

Disentangling Propagation and Generation for Video Prediction

Video Prediction Via Selective Sampling

Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention

Video Anomaly Detection Using Dual Discriminator Based Generative Adversarial Network.