Video Anomaly Detection Via Motion Completion Diffusion for Intelligent Surveillance System

Zhenhua Xue,Ronghuai Hu,Chao Huang,Zhenlin Wei
DOI: https://doi.org/10.1109/jsen.2024.3453437
IF: 4.3
2024-01-01
IEEE Sensors Journal
Abstract:Detecting abnormal behaviours in video from surveillance cameras is a crucial and challenging task in different public and industrial manufacturing scenarios. Unlike conventional techniques using raw video data from camera sensor, pose-based approach utilizes a low-dimensional, highly-structured skeleton feature, ensuring immunity to background disturbances and improving detection efficiency. Nevertheless, existing pose-based methods mainly utilise an encoder-decoder architecture to conduct video anomaly detection, which indeed remain unsatisfactory due to insufficient coverage of different motion pattern variants. To tackle these challenges, we propose a novel Motion Completion Diffusion Model (MCDiffusion) for anomaly detection using motion sequences extracted from camera sensor data. Our MCDiffusion is characterised by high quality sample generation and robust pattern coverage. Specifically, our model conditions on observed motion to provide more accurate and controllable motion completion results. We train a diffusion model based on motion sequence masking, where the model gradually makes generation for masked motion from random noise to learn normal patterns. Anomaly is determined based on the error between the masked motion and its generation. Additionally, we construct human pose as a hierarchical spatio-temporal graph to capture dynamic interactions among individuals and the pose within each individual. Our MCDiffusion achieves state-of-the-art performance on four widely used video anomaly detection (VAD) datasets, thus setting a new benchmark for online anomaly detection of video cameras.
What problem does this paper attempt to address?