Weakly supervised anomaly detection with multi-level contextual modeling

Mengting Liu,Xinrui Li,Yongge Liu,Yahong Han
DOI: https://doi.org/10.1007/s00530-023-01093-y
IF: 3.9
2023-01-01
Multimedia Systems
Abstract:Due to the low frequency of abnormal behaviors in the real world and the complexity of the definition of abnormal behaviors, anomaly detection in the surveillance video is very challenging. Weakly supervised video anomaly detection has recently been formulated as a multiple instance learning task. Although current methods show effective detection performance and alleviate the imbalanced data problem caused by the scarcity of abnormal behaviors, it still faces great challenges in distinguishing abnormal behaviors similar to normal behaviors. Also, the frequently used methods for weakly supervised video anomaly detection sometimes overlook the impact of the temporal factor. For example, the surrounding of the most anomalous video segment is more likely to be abnormal also. To alleviate the issue mentioned above, we propose a cascaded multi-level contextual content analysis module (CMC), which adapts temporal-aware graph convolutional network and non-local neural network to aggregate the contextual features of local and non-local video clips. CMC enlarges the distance between hard positive abnormal instances and normal instances and further strengthens the expression of multiple instance features. We evaluate our method on two benchmark datasets and conduct extensive ablation studies. The performance improvement demonstrates the effectiveness of our method.
What problem does this paper attempt to address?