Abstract:Due to the fuzziness of anomaly definition and the complexity of scenes in real video data, video anomaly detection is still a challenging task. In this work, we explored a novel lightweight dual branch convolution neural architecture that can separate appearance-motion representations to capture spatial and temporal information, respectively, since abnormal events are usually different from normal cases in appearance or motion behavior. Considering the channel redundancy problem in the traditional neural network, and the feature information processed by different branches is different, the corresponding channel attenuation is carried out, which greatly improves the speed of anomaly detection while maintaining the performance of the model. In order to improve the utilization of key features, we exploited a Channel Squeeze and Excitation module and insert it into the encoder part of the network to focus on the channel correlation and adaptively recalibrate the characteristic response of the channel. The importance of each feature channel is automatically acquired through learning, and then according to the importance, the useful channels are promoted and the channels that are not useful for the current task are suppressed. In addition, in order to increase the reconstruction error of the motion encoder and consider the diversity of the normal patterns, we propose to use a memory module to augment the motion U-Net, where the items in the memory record the prototype mode of the normal data. The experiments on three benchmark datasets, UCSD Ped2, CHUK Avenue, and ShanghaiTech, demonstrate that our method achieves AUC scores of 96.3%, 87.4%, and 73.5%, respectively. The experimental speed reaches 55fps, showing a competitive performance relative to the current state of research.

Object-Guided and Motion-Refined Attention Network for Video Anomaly Detection

Learning Attention Augmented Spatial-temporal Normality for Video Anomaly Detection

Video Anomaly Detection Based on Spatio-Temporal Relationships among Objects

Attention-Driven Loss for Anomaly Detection in Video Surveillance

Video Anomaly Detection Based on Attention Mechanism

Appearance-Motion Memory Consistency Network for Video Anomaly Detection

Contrastive Attention for Video Anomaly Detection

Attention-based anomaly detection in multi-view surveillance videos

Object-based video anomaly detection using multi-attention and adaptive velocity attribute representation learning

Influence-aware Attention Networks for Anomaly Detection in Surveillance Videos

Local Attention Sequence Model for Video Object Detection

AONet: Attention network with optional activation for unsupervised video anomaly detection

Video Anomaly Detection and Localization Based on an Adaptive Intra-Frame Classification Network

Channel based approach via faster dual prediction network for video anomaly detection

Robust Unsupervised Video Anomaly Detection by Multipath Frame Prediction

Robust Unsupervised Video Anomaly Detection by Multi-Path Frame Prediction

Cognition Guided Video Anomaly Detection Framework for Surveillance Services

Spatiotemporal consistency-enhanced network for video anomaly detection

Multi-Scale Temporal Relations and Segmented Channel Attention for Video Anomaly Detection

Attention-based residual autoencoder for video anomaly detection