A Bidirectional Siamese Recurrent Neural Network for Accurate Gait Recognition Using Body Landmarks

Proma Hossain Progga,Md. Jobayer Rahman,Swapnil Biswas,Md. Shakil Ahmed,Arif Reza Anwary,Swakkhar Shatabda
DOI: https://doi.org/10.1016/j.neucom.2024.128313
2024-12-05
Abstract:Gait recognition is a significant biometric technique for person identification, particularly in scenarios where other physiological biometrics are impractical or ineffective. In this paper, we address the challenges associated with gait recognition and present a novel approach to improve its accuracy and reliability. The proposed method leverages advanced techniques, including sequential gait landmarks obtained through the Mediapipe pose estimation model, Procrustes analysis for alignment, and a Siamese biGRU-dualStack Neural Network architecture for capturing temporal dependencies. Extensive experiments were conducted on large-scale cross-view datasets to demonstrate the effectiveness of the approach, achieving high recognition accuracy compared to other models. The model demonstrated accuracies of 95.7%, 94.44%, 87.71%, and 86.6% on CASIA-B, SZU RGB-D, OU-MVLP, and Gait3D datasets respectively. The results highlight the potential applications of the proposed method in various practical domains, indicating its significant contribution to the field of gait recognition.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve several key problems in gait recognition, as follows: 1. **Improve the accuracy and reliability of gait recognition**: Traditional gait recognition methods perform poorly when dealing with factors such as different viewing angles, clothing changes, lighting conditions, and walking surfaces. This paper proposes a new method to overcome these problems and improve the accuracy and robustness of gait recognition. 2. **Capture spatio - temporal dependencies**: Gait recognition needs to consider both spatial and temporal features simultaneously. Existing methods either ignore important temporal information (such as space - based methods) or are unable to effectively capture the dynamic differences between frames (such as time - based methods). This paper solves this problem by introducing a Siamese BiGRU - dualStack Neural Network, thereby better capturing the spatio - temporal dependencies of gait. 3. **Deal with viewing angle changes**: An important challenge in gait recognition is data variation at different viewing angles. This paper uses Procrustes analysis to align gait frames, reducing the impact of different viewing angles and thus improving the accuracy of recognition. 4. **Utilize advanced pose estimation models**: In order to more comprehensively cover the gait cycle, this paper adopts the Mediapipe pose estimation model to obtain continuous gait key points. These key points can more accurately represent an individual's walking pattern. ### Specific solutions - **Pose estimation**: Use the Mediapipe pose estimation model to extract gait key points. - **Alignment processing**: Align gait frames through Procrustes analysis to reduce the impact of viewing angle changes. - **Network architecture**: Design a Siamese BiGRU - dualStack neural network architecture to capture spatio - temporal dependencies in gait sequences and train it through a contrastive loss function. ### Experimental verification This method has been experimentally verified on multiple large - scale cross - view datasets, including CASIA - B, SZU RGB - D, OU - MVLP, and Gait3D datasets, achieving significant improvements in recognition accuracy, reaching accuracies of 95.7%, 94.44%, 87.71%, and 86.6% respectively. ### Main contributions - Propose a new gait recognition method that can accurately distinguish an individual's unique walking pattern. - Utilize the Mediapipe pose estimation model to obtain continuous gait key points, ensuring comprehensive coverage of the gait cycle. - Align gait key points through Procrustes analysis, reducing the impact of viewing angle changes. - Design and apply a Siamese BiGRU - dualStack neural network architecture to effectively capture spatio - temporal dependencies in gait sequences. These improvements make this method highly practical in practical applications, especially in areas such as access control, covert video surveillance, criminal investigation, and forensic analysis.