An ensemble based approach for violence detection in videos using deep transfer learning

Gurmeet Kaur,Sarbjeet Singh
DOI: https://doi.org/10.1007/s11042-024-19388-1
IF: 2.577
2024-05-21
Multimedia Tools and Applications
Abstract:The detection of violence in videos has become an extremely valuable application in real-life situations, which aim to maintain and protect people's safety. Despite the complexities inherent in videos and the abrupt nature of violent actions, the field has seen several approaches, yet achieving consistent performance remains elusive, especially with advanced real-life datasets. Presenting a solution, the paper proposes a Bagging ensemble based approach comprising three pretrained models integrated with stacked Long Short-Term Memory (LSTM) to enhance individual model performance. This ensemble approach is rigorously analyzed on two publicly accessible datasets, RLVS and RWF-2000, providing remarkable accuracy (96.6%, 92.7%) and F1-scores (96.6%, 93.0%). Additionally, a cross-dataset analysis demonstrates the model's ability to generalize across diverse datasets. Furthermore, a study of ablation highlighting the efficacy and optimal selection of components in augmenting the proposed ensemble's efficiency.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?