Attention-Augmented Electromagnetic Representation of Sign Language for Human-Computer Interaction in Deaf-and-Mute Community

Shengchang Lan,Linting Ye,Kang Zhang
DOI: https://doi.org/10.23919/USNC-URSI51813.2021.9703456
2021-01-01
Abstract:In order to provide a new interface between computers and deaf-and-dumb users, this paper proposed a method of translating sign language into a sequence of time-frequency spectrograms based on a 24 GHz 1T-2R Doppler radar sensor. By processing two pairs of the immediate frequency I/Q signals based on time-frequency analysis, a complete sign sentence can be captured and segmented according to the electromagnetic wave-based patterns. Rather than the traditional classifier, a convolutional neural network was utilized to classify the basic signs and make the complete sentence lucid to the computer. For greater accuracy, an attention module was augmented to the network. The proposed methods could reach the accuracy of 96% in translating short sentences such as “Yes”, “No”, “Thanks”, and “Hello”, which are with the highest usage rate in sign language. The work done by this paper can be considered as a supplement to current human-computer interactions, especially for the deaf-and-dumb community.
What problem does this paper attempt to address?