Abstract:The limited domain generalization capability of contemporary video anomaly detection methods restricts their efficacy to specific datasets. To enhance the generalizability and portability of video anomaly detection models, we propose a domain adaptation network framework with robust generalization performance. The objective of the framework is to enable the video anomaly detection model to generalize from the source domain to the untrained target domain while mitigating the impact of missing labeled data on deep architectures. The framework incorporates a graph-based domain-invariant representation learning module and domain discriminator that enable the model to learn deep features with domain-invariant properties that remain unchanged across different domains by calculating the strength of the relationships among domain nodes. Notably, inspired by domain adversarial learning, the framework utilizes a gradient reversal layer acting on backpropagation that guides the parameters of optimal feature mapping in constructing the loss with opposing directions. To address the domain generalization problem in video anomaly detection, this framework applies graph convolution techniques. The framework leverages a novel adjacency matrix that encourages high coherence within the same domain while optimizing the mapping of low-level deep features from source to target domains to enhance the discriminative performance of the video anomaly detection model in the target domain. Simulation experiments were conducted on Avenue, UCSD-Ped1, UCSD-Ped2, ShanghaiTech, UCF-Crime, and TAD datasets, and labeled data from the source domain were utilized during the training process. Various testing results demonstrate that our framework enables models trained in one or more different scenes (domains) to perform well in unknown scenes (domains) with good cross-domain testing AUC performance. For example, in multidomain training generalization to the Avenue dataset for testing, our domain adversarial learning framework improves detection accuracy by 12.47%. Under severe single-domain generalization scenarios, the AUC performance on the target domain (e.g., UCF-Crime dataset) increase by 4.36%, 8.64%, and 3.68%, respectively.

Domain Generalization for Video Anomaly Detection Considering Diverse Anomaly Types

Graph-based domain adversarial learning framework for video anomaly detection domain generalization

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models.

Configurable Spatial-Temporal Hierarchical Analysis for Flexible Video Anomaly Detection

Open-Vocabulary Video Anomaly Detection

Rethinking Prediction-Based Video Anomaly Detection from Local-Global Normality Perspective

Cross-Domain Video Anomaly Detection without Target Domain Adaptation

Cognition Guided Video Anomaly Detection Framework for Surveillance Services

VideoDG: Generalizing Temporal Relations in Videos to Novel Domains

Diversifying Spatial-Temporal Perception for Video Domain Generalization

Appearance Blur-driven AutoEncoder and Motion-guided Memory Module for Video Anomaly Detection

Video Anomaly Detection Based on Spatio-Temporal Relationships among Objects

Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly Detection.

Advancing Video Anomaly Detection: A Concise Review and a New Dataset

Multi-scale Spatial-temporal Interaction Network for Video Anomaly Detection

Deep Video Anomaly Detection: Opportunities and Challenges

Normality learning reinforcement for anomaly detection in surveillance videos

Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

Spatiotemporal consistency-enhanced network for video anomaly detection