BGRU-MTRA: bilinear GRU networks with multi-path temporal residual attention for suspicious activity recognition

Ajeet Pandey,Piyush Kumar
DOI: https://doi.org/10.1007/s00521-024-10416-7
2024-11-25
Neural Computing and Applications
Abstract:Suspicious activity recognition (SAR) is an active research field in computer vision and image processing due to the rapid demand for intelligent video surveillance systems. However, current automated systems focus too much on the temporal dynamics of events in videos and ignore the importance of spatial dynamics. The existing methods for SAR are complex and demand substantial resources. Hence, this paper proposes a novel trade-off architecture for SAR integrating the strengths of the multi-path temporal residual attention (MTRA) module with the bilinear-gated recurrent unit (BGRU) module. The MTRA module combines spatial feature extraction (SFE) with temporal residual attention network (TRAN) blocks for improving SAR by enhancing resilience to variations in object sizes, viewpoints, and motion patterns. It selects relevant action features, addresses vanishing gradient issues, and reduces spatial dimension. The BGRU module preserves spatial features, improving the model's ability to recognize fine-grained features and complex temporal patterns for effectively recognizing suspicious activities. The BGRU-MTRA system yields recognition accuracy of 93.28%, 98.97%, 99.86%, and 99.66%, and 48.42% on the benchmark hybrid crime action (HCA), UT interaction (UTI), CAVIER, hockey fight (HF), and the most challenging UCF-crime datasets, respectively. Through comprehensive experiments, it is clear that the proposed method outperforms state-of-the-art methods over benchmark datasets with reduced parameters.
computer science, artificial intelligence
What problem does this paper attempt to address?