Dual-module spatial temporal information enhancement graph convolutional network for recognizing traffic police command gestures

Peicheng Shi,Qing Zhang,Aixi Yang
DOI: https://doi.org/10.1007/s11760-024-03729-6
IF: 1.583
2024-12-09
Signal Image and Video Processing
Abstract:The rapid and accurate recognition of traffic police hand gestures holds significant importance for intelligent vehicles and smart transportation. However, existing algorithms face challenges in finely distinguishing traffic police gestures in dense crowds, and their recognition speed often fails to meet practical application demands. To address this, our research proposes a method for traffic police gesture recognition based on a dual-module spatial temporal information enhancement graph convolutional network (STIE-GCN). The proposed method introduces the Traffic Police Target Detection and Pose Skeleton Extraction (TD-PSE) to eliminate interference from complex environments on gesture recognition. Subsequently, we incorporate the Synergy Attention Module (SAM) and Keyframe Extraction Module (KEM) into the spatial temporal graph convolutional network to enhance the network's capability to extract synergistic action features and key action frames. The effectiveness of this method is evaluated on three different datasets, and the experimental results demonstrate that the proposed approach achieves an impressive accuracy of 98.63% in traffic police gesture recognition, with an average model response time of 1.036 s. These results highlight the method's precision and efficiency in real-world applications.
engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?