Electromyogram-Based Lip-Reading via Unobtrusive Dry Electrodes and Machine Learning Methods.

P. Djurić,S. Mallipattu,Shangyouqiao Yu,Yuanqing Song,Zimeng Zhang,Shanshan Yao,Penghao Dong
DOI: https://doi.org/10.1002/smll.202205058
IF: 13.3
2023-01-26
Small
Abstract:Lip-reading provides an effective speech communication interface for people with voice disorders and for intuitive human-machine interactions. Existing systems are generally challenged by bulkiness, obtrusiveness, and poor robustness against environmental interferences. The lack of a truly natural and unobtrusive system for converting lip movements to speech precludes the continuous use and wide-scale deployment of such devices. Here, the design of a hardware-software architecture to capture, analyze, and interpret lip movements associated with either normal or silent speech is presented. The system can recognize different and similar visemes. It is robust in a noisy or dark environment. Self-adhesive, skin-conformable, and semi-transparent dry electrodes are developed to track high-fidelity speech-relevant electromyogram signals without impeding daily activities. The resulting skin-like sensors can form seamless contact with the curvilinear and dynamic surfaces of the skin, which is crucial for a high signal-to-noise ratio and minimal interference. Machine learning algorithms are employed to decode electromyogram signals and convert them to spoken words. Finally, the applications of the developed lip-reading system in augmented reality and medical service are demonstrated, which illustrate the great potential in immersive interaction and healthcare applications.
Computer Science,Medicine,Engineering
What problem does this paper attempt to address?