Deep Learning-Based Human Action Recognition in Videos

Song Li,Qian Shi
DOI: https://doi.org/10.1142/s0218126625500409
2024-09-28
Journal of Circuits Systems and Computers
Abstract:Journal of Circuits, Systems and Computers, Ahead of Print. In order to solve the problem of low accuracy and efficiency in video human behavior recognition algorithm, a deep learning video human behavior recognition algorithm is proposed, which is based on an improved time division network. This method innovates on the classical two-stream convolutional neural network framework, and the core is to enhance the performance of the time division network by implementing the sliding window sampling technique with multiple time scales. This sampling strategy not only effectively integrates the full time-series information of the video, but also accurately captures the long-term dependencies hidden in human behavior, which further improves the accuracy and efficiency of behavior recognition. Experimental results show that the method proposed in this paper has achieved good advantages in multiple data sets. On HMDB51, our method achieves 84% recognition accuracy, while on the more complex Kinetics and UCF101 datasets, it also achieves 94% and significant recognition results, respectively. In the face of complex scenes and changeable human body structure, the proposed algorithm shows excellent robustness and stability. In terms of real-time, it can meet the high requirements of real-time video processing. Through the validation of experimental data, our method has made significant progress in extracting spatiotemporal features, capturing long-term dependencies, and focusing on key information.
engineering, electrical & electronic,computer science, hardware & architecture
What problem does this paper attempt to address?