Cross-Speaker Silent-Speech Command Word Recognition Using Electro-Optical Stomatography

Simon Stone,Peter Birkholz
DOI: https://doi.org/10.1109/icassp40776.2020.9053447
2020-05-01
Abstract:Speech recognition based on articulatory movements instead of the acoustic signal is of growing interest in the community. In this work, we present the results of a study using a novel measurement technology called Electro-Optical Stomatography to capture speech movements and use the acquired data to recognize a number of command words. The performance of the recognition system was evaluated using two vocabularies (one with 30 and one with 10 words) and four speakers. The speaker-dependent results were up to the state-of-the-art with average word accuracies of 97% to 99.5%, while the speaker-independent results exceeded it with average word accuracies of approx. 56% to 62%.
What problem does this paper attempt to address?