Arabic sign language letters recognition using Vision Transformer

Aya F. Alnabih,Ashraf Y. Maghari,Alnabih, Aya F.
DOI: https://doi.org/10.1007/s11042-024-18681-3
IF: 2.577
2024-03-07
Multimedia Tools and Applications
Abstract:Sign languages, as means of communication, enable individuals to convey messages through hand gestures, body movements, and facial expressions. It is primarily utilized by those with hearing impairments to communicate with others. Currently the recognition of static sign language predominantly relies on the Convolutional Neural Network (CNN) approaches and transfer learning for classifying hand sign images. While researchers have been actively working on this issue, only a few have explored the potential of Vision Transformers (ViT) in addressing the sign language recognition problem, particularly Arabic sings. Where, no record or documentation aimed at identifying Arabic sign language letters using ViT model. Vision Transformers is a new addition to the world of deep neural networks and have shown promising performance with less computational power required compared to existing methods. This paper aims to leverage the capabilities of Vision Transformers to improve the accuracy of Arabic sign language recognition. In that context, a ViT-based model is proposed in which a pre trained ViT model is fine tuned to be adapted for recognizing Arabic sign language letters. To finetune and evaluate our ViT-based model, we utilize the ArSL2018 dataset which consists of 54,049 images of Arabic sign language letters with 32 classes. Accuracy, F1 score, recall and precision are used for evaluation. The proposed model achieved an accuracy of 99.3% on the ArSL2018 dataset, which outperforms some recent CNN based models. Additionally, we conducted evaluations on real case images to assess the practicality and real-world applicability of our model.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?