Abstract:Researchers present a silent speech interface optimized for human‐robot collaboration. Using an unobtrusive, conformal, transparent, and ion‐conductive electromyography electrode array, the system captures speech‐relevant muscle activities, enabling silent speech control of a robotic manipulator. Integrated with an optical hand‐tracking system, it facilitates natural robot control and collaborative tasks in noisy environments, enhancing human‐robot interaction. Silent speech interfaces offer an alternative and efficient communication modality for individuals with voice disorders and when the vocalized speech communication is compromised by noisy environments. Despite the recent progress in developing silent speech interfaces, these systems face several challenges that prevent their wide acceptance, such as bulkiness, obtrusiveness, and immobility. Herein, the material optimization, structural design, deep learning algorithm, and system integration of mechanically and visually unobtrusive silent speech interfaces are presented that can realize both speaker identification and speech content identification. Conformal, transparent, and self‐adhesive electromyography electrode arrays are designed for capturing speech‐relevant muscle activities. Temporal convolutional networks are employed for recognizing speakers and converting sensing signals into spoken content. The resulting silent speech interfaces achieve a 97.5% speaker classification accuracy and 91.5% keyword classification accuracy using four electrodes. The speech interface is further integrated with an optical hand‐tracking system and a robotic manipulator for human‐robot collaboration in both assembly and disassembly processes. The integrated system achieves the control of the robot manipulator by silent speech and facilitates the hand‐over process by hand motion trajectory detection. The developed framework enables natural robot control in noisy environments and lays the ground for collaborative human‐robot tasks involving multiple human operators.

Mixed-modality speech recognition and interaction using a wearable artificial throat

Wearable intelligent throat enables natural speech in stroke patients with dysarthria

An Intelligent Artificial Throat with Sound-Sensing Ability Based on Laser Induced Graphene

Electromyogram-strain synergetic intelligent artificial throat

A Wearable Skin-Like Ultra-Sensitive Artificial Graphene Throat.

Review of Intelligent, Flexible Artificial Throats with Sound Emitting, Detecting, and Recognizing Ability

Deep‐Learning‐Enabled MXene‐Based Artificial Throat: Toward Sound Detection and Speech Recognition

Speech Recognition Using Intelligent Piezoresistive Sensor Based on Polystyrene Sphere Microstructures

A Wearable Vision-To-Audio Sensory Substitution Device for Blind Assistance and the Correlated Neural Substrates

Bioinspired dual-channel speech recognition using graphene-based electromyographic and mechanical sensors

Decoding Silent Speech Commands from Articulatory Movements Through Soft Magnetic Skin and Machine Learning

Speaking without vocal folds using a machine-learning-assisted wearable sensing-actuation system

A Wearable Swallowing Recognition System Based on Motion and Dual Photoplethysmography Sensing of Laryngeal Movements

A fully integrated, standalone stretchable device platform with in-sensor adaptive machine learning for rehabilitation

All-weather, natural silent speech recognition via machine-learning-assisted tattoo-like electronics

Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency

Decoding Throat-language Using Flexibility Sensors with Machine Learning

Graphene‐based Dual‐function Acoustic Transducers for Machine Learning‐assisted Human–robot Interfaces

Ultrathin Eardrum‐Inspired Self‐Powered Acoustic Sensor for Vocal Synchronization Recognition with the Assistance of Machine Learning

An epidermal sEMG tattoo-like patch as a new human-machine interface for patients with loss of voice

Decoding Silent Speech Cues From Muscular Biopotential Signals for Efficient Human‐Robot Collaborations