Abstract:This paper presents an anomaly detection method that is based on a sparse coding inspired Deep Neural Networks (DNN). Specifically, in light of the success of sparse coding based anomaly detection, we propose a Temporally-coherent Sparse Coding (TSC), where a temporally-coherent term is used to preserve the similarity between two similar frames. The optimization of sparse coefficients in TSC with the Sequential Iterative Soft-Thresholding Algorithm (SIATA) is equivalent to a special stacked Recurrent Neural Networks (sRNN) architecture. Further, to reduce the computational cost in alternatively updating the dictionary and sparse coefficients in TSC optimization and to alleviate hyperparameters selection in TSC, we stack one more layer on top of the TSC-inspired sRNN to reconstruct the inputs, and arrive at an sRNN-AE. We further improve sRNN-AE in the following aspects: i) rather than using a predefined similarity measurement between two frames, we propose to learn a data-dependent similarity measurement between neighboring frames in sRNN-AE to make it more suitable for anomaly detection; ii) to reduce computational costs in the inference stage, we reduce the depth of the sRNN in sRNN-AE and, consequently, our framework achieves real-time anomaly detection; iii) to improve computational efficiency, we conduct temporal pooling over the appearance features of several consecutive frames for summarizing information temporally, then we feed appearance features and temporally summarized features into a separate sRNN-AE for more robust anomaly detection. To facilitate anomaly detection evaluation, we also build a large-scale anomaly detection dataset which is even larger than the summation of all existing datasets for anomaly detection in terms of both the volume of data and the diversity of scenes. Extensive experiments on both a toy dataset under controlled settings and real datasets demonstrate that our method significantly outperforms existing -ethods, which validates the effectiveness of our sRNN-AE method for anomaly detection. Codes and data have been released at https://github.com/StevenLiuWen/sRNN_TSC_Anomaly_Detection.

Fusing Crops Representation into Snippet Via Mutual Learning for Weakly Supervised Surveillance Anomaly Detection

Learning Discrimination from Contaminated Data: Multi-Instance Learning for Unsupervised Anomaly Detection

Multimodal and multiscale feature fusion for weakly supervised video anomaly detection

A Lightweight Video Anomaly Detection Model with Weak Supervision and Adaptive Instance Selection

Anomalies cannot materialize or vanish out of thin air: A hierarchical multiple instance learning with position-scale awareness for video anomaly detection

Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection

Learning to Detect Anomalies in Surveillance Video.

MTFL: Multi-Timescale Feature Learning for Weakly-Supervised Anomaly Detection in Surveillance Videos

Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Decoupled appearance and motion learning for efficient anomaly detection in surveillance video

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

Real-world Video Anomaly Detection by Extracting Salient Features in Videos

Real-world Anomaly Detection in Surveillance Videos

Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection

Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks

Dy-MIL: dynamic multiple-instance learning framework for video anomaly detection

Deep learning based anomaly detection in real-time video

A MIL Approach for Anomaly Detection in Surveillance Videos from Multiple Camera Views

Multi-Scale Video Anomaly Detection by Multi-Grained Spatio-Temporal Representation Learning

Integrated Multiscale Appearance Features and Motion Information Prediction Network for Anomaly Detection