Abstract:The limited domain generalization capability of contemporary video anomaly detection methods restricts their efficacy to specific datasets. To enhance the generalizability and portability of video anomaly detection models, we propose a domain adaptation network framework with robust generalization performance. The objective of the framework is to enable the video anomaly detection model to generalize from the source domain to the untrained target domain while mitigating the impact of missing labeled data on deep architectures. The framework incorporates a graph-based domain-invariant representation learning module and domain discriminator that enable the model to learn deep features with domain-invariant properties that remain unchanged across different domains by calculating the strength of the relationships among domain nodes. Notably, inspired by domain adversarial learning, the framework utilizes a gradient reversal layer acting on backpropagation that guides the parameters of optimal feature mapping in constructing the loss with opposing directions. To address the domain generalization problem in video anomaly detection, this framework applies graph convolution techniques. The framework leverages a novel adjacency matrix that encourages high coherence within the same domain while optimizing the mapping of low-level deep features from source to target domains to enhance the discriminative performance of the video anomaly detection model in the target domain. Simulation experiments were conducted on Avenue, UCSD-Ped1, UCSD-Ped2, ShanghaiTech, UCF-Crime, and TAD datasets, and labeled data from the source domain were utilized during the training process. Various testing results demonstrate that our framework enables models trained in one or more different scenes (domains) to perform well in unknown scenes (domains) with good cross-domain testing AUC performance. For example, in multidomain training generalization to the Avenue dataset for testing, our domain adversarial learning framework improves detection accuracy by 12.47%. Under severe single-domain generalization scenarios, the AUC performance on the target domain (e.g., UCF-Crime dataset) increase by 4.36%, 8.64%, and 3.68%, respectively.

Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation

Multi-View Domain Adaptive Object Detection on Camera Networks.

Deep Joint Two-Stream Wasserstein Auto-Encoder and Selective Attention Alignment for Unsupervised Domain Adaptation

AdvMix: Adversarial Mixing Strategy for Unsupervised Domain Adaptive Object Detection

Temporal Attentive Alignment for Large-Scale Video Domain Adaptation

Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation

Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding

Multi-source Distilling Domain Adaptation

Trust-aware Conditional Adversarial Domain Adaptation with Feature Norm Alignment.

Cross-domain video action recognition via adaptive gradual learning

Multiple Source Domain Adaptation with Adversarial Training of Neural Networks

Video domain adaptation for semantic segmentation using perceptual consistency matching

A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection

Dynamic Video Mix-Up for Cross-Domain Action Recognition

Multicomponent Adversarial Domain Adaptation: A General Framework.

Graph-based domain adversarial learning framework for video anomaly detection domain generalization

Multi-Source Domain Adaptation with Mixture of Joint Distributions

Adversarial Bipartite Graph Learning for Video Domain Adaptation

Temporally Coherent Video Harmonization Using Adversarial Networks

Confidence Attention and Generalization Enhanced Distillation for Continuous Video Domain Adaptation

DyMix: Dynamic Frequency Mixup Scheduler based Unsupervised Domain Adaptation for Enhancing Alzheimer's Disease Identification