Real-Time Vision-Based Chinese Sign Language Recognition with Pose Estimation and Attention Network

Sirui Cheng,Chaorui Huang,Zhaohui Wang,Jiaxing Wang,Zhen Zeng,Fei Wang,Qichuan Ding
DOI: https://doi.org/10.1109/robio54168.2021.9739638
2021-01-01
Abstract:Currently, communication between deaf and hearing people is still facing great problems both at the research level and at the application level. Also, practical automatic sign language interpretation methods have become a relatively large demand. In this paper, we will recognize video sign language based on computer vision to capture skeletal key point information using multi-head attention mechanism and long and short-term memory mechanism. We construct a database of Chinese video sign language consisting of 30 words, with a total data volume of more than 1200. Meanwhile, the experiments of our proposed framework on this dataset achieved 85% accuracy rate. The experimental results show that our proposed method has the characteristics of high accuracy and light weight in the problem of Chinese sign language recognition.
What problem does this paper attempt to address?