Two‐stage video anomaly detection based on dual‐stream networks and multi‐instance learning
Dejun Zhang,Wenbo Fang,Yuhang Liu,Zirong Lyu,Chen Xiong,Zhan Wang
DOI: https://doi.org/10.1049/ipr2.13286
IF: 2.3
2024-12-03
IET Image Processing
Abstract:A two‐stage segment‐level video anomaly detection framework is proposed; specifically, in the first stage, an improved I3D network is used as a feature extractor to capture spatiotemporal features from the input video. In the second stage, a multiple instance learning method is introduced, where the averaged spatiotemporal features output by the I3D network are fed into a segment‐level anomaly classifier to construct an anomaly detection model using a deep multiple instance ranking framework. To promptly detect abnormal events in surveillance videos, this article designs a video anomaly detection method based on multiple instance learning. Generally, abnormal events occur less frequently compared to normal events. Traditional video surveillance relies on manual operation to monitor scenes and detect abnormal events by watching surveillance videos. However, watching surveillance footage is a labor‐intensive task, and prolonged observation can lead to visual fatigue and lack of concentration, which in turn results in missed detections and false positives [1]. Therefore, it is crucial to develop intelligent algorithms for video anomaly detection. The method can detect whether segments of a video contain abnormal events. First, the I3D network is used as a feature extractor to capture spatiotemporal features from the input video. Then, the spatiotemporal information is processed and input into a segment‐level anomaly detector based on multiple instance learning for detection. The authors treat abnormal videos as positive bags and normal videos as negative bags, and automatically learn a deep anomaly ranking model that can predict abnormal segments. Finally, the results of the training were tested and analyzed, demonstrating that the model is capable of detecting abnormal traffic segments.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology