Abstract:Video anomaly event detection is crucial for analyzing surveillance videos. Existing methods have limitations: frame-level detection fails to remove background interference, and object-level methods overlook object-environment interaction. To address these issues, this paper proposes a novel video anomaly event detection algorithm based on a dual-channel autoencoder with key region feature enhancement. The goal is to preserve valuable information in the global context while focusing on regions with a high anomaly occurrence. Firstly, a key region extraction network is proposed to perform foreground segmentation on video frames, eliminating background redundancy. Secondly, a dual-channel autoencoder is designed to enhance the features of key regions, enabling the model to extract more representative features. Finally, channel attention modules are inserted between each deconvolution layer of the decoder to enhance the model's perception and discrimination of valuable information. Compared to existing methods, our approach accurately locates and focuses on regions with a high anomaly occurrence, improving the accuracy of anomaly event detection. Extensive experiments are conducted on the UCSD ped2, CUHK Avenue, and SHTech Campus datasets, and the results validate the effectiveness of the proposed method.

What problem does this paper attempt to address?

This paper attempts to solve several key problems in video abnormal event detection: 1. **Background interference problem**: Existing frame - level detection methods cannot effectively remove background interference, resulting in the detection results being affected by background information and reducing the detection accuracy. 2. **Object - environment interaction neglect problem**: Existing object - level detection methods often neglect the interaction between objects and the environment, resulting in the inability to comprehensively capture the characteristics of abnormal events. 3. **Insufficient utilization of global and local information**: Most existing methods mainly utilize global or local information without fully considering the key regions with a high abnormal occurrence rate, which limits the detection performance of the model. To solve the above problems, this paper proposes a video abnormal event detection algorithm (KRFE - DAE) based on a dual - channel auto - encoder and key - region feature enhancement. Specifically, the main contributions of this method include: - **Key - Region Extraction Network (KREN)**: A key - region extraction network is designed to separate the foreground motion regions with a high abnormal occurrence rate from the video background and reduce the interference of background redundant information. - **Dual - channel auto - encoder structure**: A dual - channel auto - encoder structure is proposed, which can extract features from the global context and key regions, retain information that may trigger abnormalities, and enhance the features of key regions at the same time. - **Attention mechanism**: A channel - attention module is introduced at each layer of the decoder. By using the inter - channel correlation information, noise diffusion is suppressed, and the model pays more attention to the abnormal key regions, thereby increasing the reconstruction error of abnormal samples. - **Experimental verification**: Extensive experiments were carried out on three benchmark datasets, namely UCSD ped2, CUHK Avenue, and SHTech Campus, to verify the effectiveness and generalization ability of the proposed method. Through these improvements, this method can more accurately locate and focus on regions with a high abnormal occurrence rate under complex background conditions, thereby improving the accuracy of video abnormal event detection.

Dual-Channel Autoencoder with Key Region Feature Enhancement for Video Anomalous Event Detection

Attention-based residual autoencoder for video anomaly detection

Channel based approach via faster dual prediction network for video anomaly detection

Video anomaly detection based on a multi-layer reconstruction autoencoder with a variance attention strategy

Video Anomaly Detection Based on Attention Mechanism

Spatiotemporal Masked Autoencoder with Multi-Memory and Skip Connections for Video Anomaly Detection

Video Anomaly Detection Based on Spatio-Temporal Relationships among Objects

Appearance-Motion united Auto-Encoder Framework for Video Anomaly Detection

Dual contrast discriminator with sharing attention for video anomaly detection

Video Anomaly Detection Based on Global–Local Convolutional Autoencoder

Learning Appearance-motion Normality for Video Anomaly Detection.

Residual spatiotemporal autoencoder for unsupervised video anomaly detection

Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors

Dual GroupGAN: An unsupervised four-competitor (2V2) approach for video anomaly detection

An informative dual ForkNet for video anomaly detection

A Two-Branch Network for Video Anomaly Detection with Spatio-Temporal Feature Learning

Synthetic Pseudo Anomalies for Unsupervised Video Anomaly Detection: A Simple yet Efficient Framework based on Masked Autoencoder

Video anomaly detection and localization via multivariate gaussian fully convolution adversarial autoencoder

Research on Video Anomaly Detection Based on Cascaded Memory-augmented Autoencoder

Pedestrian Spatio-Temporal Information Fusion For Video Anomaly Detection

Memory Enhanced Spatial-Temporal Graph Convolutional Autoencoder for Human-Related Video Anomaly Detection.