Abstract:A two‐stage segment‐level video anomaly detection framework is proposed; specifically, in the first stage, an improved I3D network is used as a feature extractor to capture spatiotemporal features from the input video. In the second stage, a multiple instance learning method is introduced, where the averaged spatiotemporal features output by the I3D network are fed into a segment‐level anomaly classifier to construct an anomaly detection model using a deep multiple instance ranking framework. To promptly detect abnormal events in surveillance videos, this article designs a video anomaly detection method based on multiple instance learning. Generally, abnormal events occur less frequently compared to normal events. Traditional video surveillance relies on manual operation to monitor scenes and detect abnormal events by watching surveillance videos. However, watching surveillance footage is a labor‐intensive task, and prolonged observation can lead to visual fatigue and lack of concentration, which in turn results in missed detections and false positives [1]. Therefore, it is crucial to develop intelligent algorithms for video anomaly detection. The method can detect whether segments of a video contain abnormal events. First, the I3D network is used as a feature extractor to capture spatiotemporal features from the input video. Then, the spatiotemporal information is processed and input into a segment‐level anomaly detector based on multiple instance learning for detection. The authors treat abnormal videos as positive bags and normal videos as negative bags, and automatically learn a deep anomaly ranking model that can predict abnormal segments. Finally, the results of the training were tested and analyzed, demonstrating that the model is capable of detecting abnormal traffic segments.

Video event detection algorithm based on multi-scale instance learning

Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection

Abnormal event detection via multi-instance dictionary learning

A generic framework for event detection in various video domains.

Multi-Instance Dictionary Learning For Detecting Abnormal Events In Surveillance Videos

Human Detection Method Based on Multi-Part Detector and Multi-Instance Learning

Multi-scale Harmonic Mean Time Surfaces for Event-based Object Classification

Complex Video Event Detection Via Pairwise Fusion of Trajectory and Multi-Label Hypergraphs

Integrated Multi-Scale Event Verification In An Augmented Foreground Motion Space

Two‐stage video anomaly detection based on dual‐stream networks and multi‐instance learning

Multi-Scale Video Anomaly Detection by Multi-Grained Spatio-Temporal Representation Learning

Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search

Discovering Latent Discriminative Patterns for Multi-Mode Event Representation.

Coarse-to-Fine Video Instance Segmentation With Factorized Conditional Appearance Flows

Hidden Markov Model Based Events Detection In Soccer Video

Video Instance Segmentation by Instance Flow Assembly

Multiscale event detection in social media

Detecting events and key actors in multi-person videos

A Discriminative CNN Video Representation for Event Detection.

Effective video event detection via subspace projection

Complex Event Detection by Identifying Reliable Shots from Untrimmed Videos