A wearable sensor-based dynamic gesture recognition model via broad attention learning

Yanhong Liu,Xingyu Li,Lei Yang
DOI: https://doi.org/10.1007/s11760-024-03567-6
IF: 1.583
2024-12-04
Signal Image and Video Processing
Abstract:Gesture recognition, being a natural and convenient means of human-computer interaction, has emerged as a prominent area of research. The goal of accurate and swift gesture recognition has driven numerous studies, as it plays a crucial role in meeting practical application demands. Real-time monitoring of surface electromyography (sEMG) signals collected from sEMG sensor during gesture execution enables the extraction of rich gesture action features, prompting researchers to delve into sEMG-based gesture recognition algorithms. However, existing algorithms often struggle to strike a favorable balance between prediction accuracy and computational efficiency. In response to these challenges, this paper proposes a new gesture recognition model leveraging multi-channel sEMG signals captured by a sEMG sensor. The approach involves employing a lightweight convolutional neural network (CNN) model as the backbone network, supplemented by a lightweight feature extraction block (LFE) to effectively aggregate multi-scale features. Additionally, the model benefits from pre-training weights based on transfer learning, enhancing feature extraction capabilities and generalization ability for faster convergence of the prediction model. Moreover, the introduction of a plug-and-play block called the broad attention learning block (BAL) significantly improves the overall recognition performance of the proposed model. To better capture contextual information from multi-channel sEMG signals, a pyramid input with varying resolutions is constructed, elevating the model's multi-scale feature extraction capability. The proposed model is evaluated and validated on a dynamic gesture dataset collected using wearable sensor (Myo). A series of ablation experiments, comparative analyses, and visualization outcomes unequivocally showcase the remarkable efficacy and superiority of our proposed method. Comparisons with other advanced models reveal that the proposed model successfully achieves a balanced compromise between prediction accuracy and processing speed.
engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?