Jaywalking detection and localization in street scene videos using fine-tuned convolutional neural networks

Aarti Bala,Rishabh Kaushal
DOI: https://doi.org/10.1007/s11042-023-14922-z
IF: 2.577
2023-03-15
Multimedia Tools and Applications
Abstract:Video Anomaly detection has been the focus of many research studies for a long time and shows potential for endless implementations to detect real-world anomalies. When it comes to road safety, Jaywalking is one such real-life event that can be detected and localized using video anomaly detection techniques. However, this field's progress depends majorly on the availability of diverse datasets and the kind of anomalies that they depict. The main objective of this work is to propose a video anomaly detection model for jaywalking that can help control the number of road accidents that occur due to jaywalking each year. The proposed model is a novel variation of the InceptionV3 deep CNN model and has been experimenting with one of the latest street scene datasets and its different variations. Our model consists of two separate subsystems based on pre-trained InceptionV3 architecture. The first subsystem, Anomaly Detector, takes a video frame as input and predicts whether it consists of a jaywalking event. The second subsystem, Anomaly Localizer, takes a jaywalking(anomalous) video frame as input and predicts the bounding box labels to locate the Jaywalking object in the frame. To evaluate, we employ the Street Scene Dataset, released in February 2020 that offers several kinds of jaywalking events. In this work, we experiment with different pre-trained CNN models, namely VGG16, ResNet50, and InceptionV3, to evaluate their performance for anomaly detection. Further, we assess the model's performance for different dataset sizes, having both an even and an uneven number of anomalous and non-anomalous frames. Lastly, we measure the effectiveness of the model on different types of jaywalking episodes. The evaluation shows that our proposed model attains remarkable detection accuracy, in comparison to other state-of-the-art methods proposed previously in this field.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?