Dual-Channel Autoencoder with Key Region Feature Enhancement for Video Anomalous Event Detection

Qing Ye,Zihan Song,Yuqi Zhao,Yongmei Zhang
DOI: https://doi.org/10.1007/s11063-024-11634-9
IF: 2.565
2024-05-29
Neural Processing Letters
Abstract:Video anomaly event detection is crucial for analyzing surveillance videos. Existing methods have limitations: frame-level detection fails to remove background interference, and object-level methods overlook object-environment interaction. To address these issues, this paper proposes a novel video anomaly event detection algorithm based on a dual-channel autoencoder with key region feature enhancement. The goal is to preserve valuable information in the global context while focusing on regions with a high anomaly occurrence. Firstly, a key region extraction network is proposed to perform foreground segmentation on video frames, eliminating background redundancy. Secondly, a dual-channel autoencoder is designed to enhance the features of key regions, enabling the model to extract more representative features. Finally, channel attention modules are inserted between each deconvolution layer of the decoder to enhance the model's perception and discrimination of valuable information. Compared to existing methods, our approach accurately locates and focuses on regions with a high anomaly occurrence, improving the accuracy of anomaly event detection. Extensive experiments are conducted on the UCSD ped2, CUHK Avenue, and SHTech Campus datasets, and the results validate the effectiveness of the proposed method.
computer science, artificial intelligence
What problem does this paper attempt to address?
This paper attempts to solve several key problems in video abnormal event detection: 1. **Background interference problem**: Existing frame - level detection methods cannot effectively remove background interference, resulting in the detection results being affected by background information and reducing the detection accuracy. 2. **Object - environment interaction neglect problem**: Existing object - level detection methods often neglect the interaction between objects and the environment, resulting in the inability to comprehensively capture the characteristics of abnormal events. 3. **Insufficient utilization of global and local information**: Most existing methods mainly utilize global or local information without fully considering the key regions with a high abnormal occurrence rate, which limits the detection performance of the model. To solve the above problems, this paper proposes a video abnormal event detection algorithm (KRFE - DAE) based on a dual - channel auto - encoder and key - region feature enhancement. Specifically, the main contributions of this method include: - **Key - Region Extraction Network (KREN)**: A key - region extraction network is designed to separate the foreground motion regions with a high abnormal occurrence rate from the video background and reduce the interference of background redundant information. - **Dual - channel auto - encoder structure**: A dual - channel auto - encoder structure is proposed, which can extract features from the global context and key regions, retain information that may trigger abnormalities, and enhance the features of key regions at the same time. - **Attention mechanism**: A channel - attention module is introduced at each layer of the decoder. By using the inter - channel correlation information, noise diffusion is suppressed, and the model pays more attention to the abnormal key regions, thereby increasing the reconstruction error of abnormal samples. - **Experimental verification**: Extensive experiments were carried out on three benchmark datasets, namely UCSD ped2, CUHK Avenue, and SHTech Campus, to verify the effectiveness and generalization ability of the proposed method. Through these improvements, this method can more accurately locate and focus on regions with a high abnormal occurrence rate under complex background conditions, thereby improving the accuracy of video abnormal event detection.