RBI-2RCNN: Residual Block Intensity Feature using a Two-stage Residual Convolutional Neural Network for Static Hand Gesture Recognition

Jaya Prakash Sahoo,Suraj Prakash Sahoo,Samit Ari,Sarat Kumar Patra
DOI: https://doi.org/10.1007/s11760-022-02163-w
2022-02-18
Abstract:Hand gesture recognition (HGR) is the most effective and intuitive way for the human–computer interface used in various applications, such as sign language recognition, robotics, and multimedia applications. The performance of the existing handcrafted techniques relies on the less generalized preprocessing and feature extraction steps. The convolutional neural networks (CNNs) can handle the less generalization characteristic of the handcrafted techniques. However, the architecture of CNN is complex and the extracted deep feature from these networks provides the global information only. Therefore, a CNN-based HGR paradigm can be developed with less number of layers and feature fusion with global and local information from different layers. Motivated by the above facts, this work proposes (i) a two-stage residual CNN (2RCNN) architecture for learning of features from the color hand gesture images which overcomes the need of a specific preprocessing step, (ii) a novel residual block intensity (RBI) feature to extract the global and local information from the hand gesture images. After extracting the RBI features, a linear kernel-based multi-class support vector machine classifier is used to recognize the gesture poses. The experimental results are evaluated using a subject-independent cross-validation test on three benchmarked datasets and compared with the earlier reported techniques.
What problem does this paper attempt to address?