Intelligent real-time Arabic sign language classification using attention-based inception and BiLSTM

Wadood Abdul,Mansour Alsulaiman,Syed Umar Amin,Mohammed Faisal,Ghulam Muhammad,Fahad R. Albogamy,Mohamed A. Bencherif,Hamid Ghaleb
DOI: https://doi.org/10.1016/j.compeleceng.2021.107395
2021-10-01
Abstract:Bio-inspired deep learning models have revolutionized sign language classification, achieving extraordinary accuracy and human-like video understanding. Recognition and classification of sign language videos in real-time are challenging because the duration and speed of each sign vary for different subjects, the background of videos is dynamic in most cases, and the classification result should be produced in real-time. This study proposes a model based on a convolution neural network (CNN) Inception model with an attention mechanism for extracting spatial features and Bi-LSTM (long short-term memory) for temporal feature extraction. The proposed model is tested on datasets with highly variable characteristics such as different clothing, variable lighting, and variable distance from the camera. Real-time classification achieves significant early detections while achieving performance comparable to the offline operation. The proposed model has fewer parameters, fewer deep learning layers, and requires significantly less processing time than state-of-the-art models.
engineering, electrical & electronic,computer science, interdisciplinary applications, hardware & architecture
What problem does this paper attempt to address?