Abstract:Sign language recognition technology can help people with hearing impairments to communicate with non-hearing-impaired people. At present, with the rapid development of society, deep learning also provides certain technical support for sign language recognition work. In sign language recognition tasks, traditional convolutional neural networks used to extract spatio-temporal features from sign language videos suffer from insufficient feature extraction, resulting in low recognition rates. Nevertheless, a large number of video-based sign language datasets require a significant amount of computing resources for training while ensuring the generalization of the network, which poses a challenge for recognition. In this paper, we present a video-based sign language recognition method based on Residual Network (ResNet) and Long Short-Term Memory (LSTM). As the number of network layers increases, the ResNet network can effectively solve the granularity explosion problem and obtain better time series features. We use the ResNet convolutional network as the backbone model. LSTM utilizes the concept of gates to control unit states and update the output feature values of sequences. ResNet extracts the sign language features. Then, the learned feature space is used as the input of the LSTM network to obtain long sequence features. It can effectively extract the spatio-temporal features in sign language videos and improve the recognition rate of sign language actions. An extensive experimental evaluation demonstrates the effectiveness and superior performance of the proposed method, with an accuracy of 85.26%, F1-score of 84.98%, and precision of 87.77% on Argentine Sign Language (LSA64).

Latent Support Vector Machine Modeling for Sign Language Recognition with Kinect

Discriminative exemplar coding for sign language recognition with Kinect.

Sign Language Recognition Using Microsoft Kinect

Sign Language Recognition with Multi-modal Features.

Sign Language Recognition from Digital Videos Using Deep Learning Methods

Chinese sign language recognition with adaptive HMM

Sign language recognition using a combination of new vision based features

Sign Language Recognition with Long Short-Term Memory.

Sign Language Recognition Based on Trajectory Modeling with HMMs.

Recognizing American Sign Language Manual Signs from Rgb-D Videos

Video-Based Sign Language Recognition via ResNet and LSTM Network

A new system for Chinese sign language recognition

Sign language recognition using real-sense

Multi-View Spatial-Temporal Network for Continuous Sign Language Recognition

MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition

Natural Language-Assisted Sign Language Recognition

SignCol: Open-Source Software for Collecting Sign Language Gestures

A Classification Model Utilizing Facial Landmark Tracking to Determine Sentence Types for American Sign Language Recognition

Sign Language Recognition Based on Adaptive Hmms with Data Augmentation

Indian Sign Language Gesture Recognition using Image Processing and Deep Learning

Video-Based Sign Language Recognition Without Temporal Segmentation