Abstract:Objectives: Watching a talker’s mouth is beneficial for speech reception (SR) in many communication settings, especially in noise and when hearing is impaired. Measures for audiovisual (AV) SR can be valuable in the framework of diagnosing or treating hearing disorders. This study addresses the lack of standardized methods in many languages for assessing lipreading, AV gain, and integration. A new method is validated that supplements a German speech audiometric test with visualizations of the synthetic articulation of an avatar that was used, for it is feasible to lip-sync auditory speech in a highly standardized way. Three hypotheses were formed according to the literature on AV SR that used live or filmed talkers. It was tested whether respective effects could be reproduced with synthetic articulation: (1) cochlear implant (CI) users have a higher visual-only SR than normal-hearing (NH) individuals, and younger individuals obtain higher lipreading scores than older persons. (2) Both CI and NH gain from presenting AV over unimodal (auditory or visual) sentences in noise. (3) Both CI and NH listeners efficiently integrate complementary auditory and visual speech features. Design: In a controlled, cross-sectional study with 14 experienced CI users (mean age 47.4) and 14 NH individuals (mean age 46.3, similar broad age distribution), lipreading, AV gain, and integration of a German matrix sentence test were assessed. Visual speech stimuli were synthesized by the articulation of the Talking Head system “MASSY” (Modular Audiovisual Speech Synthesizer), which displayed standardized articulation with respect to the visibility of German phones. Results: In line with the hypotheses and previous literature, CI users had a higher mean visual-only SR than NH individuals (CI, 38%; NH, 12%; p < 0.001). Age was correlated with lipreading such that within each group, younger individuals obtained higher visual-only scores than older persons (rCI = −0.54; p = 0.046; rNH = −0.78; p < 0.001). Both CI and NH benefitted by AV over unimodal speech as indexed by calculations of the measures visual enhancement and auditory enhancement (each p < 0.001). Both groups efficiently integrated complementary auditory and visual speech features as indexed by calculations of the measure integration enhancement (each p < 0.005). Conclusions: Given the good agreement between results from literature and the outcome of supplementing an existing validated auditory test with synthetic visual cues, the introduced method can be considered an interesting candidate for clinical and scientific applications to assess measures important for AV SR in a standardized manner. This could be beneficial for optimizing the diagnosis and treatment of individual listening and communication disorders, such as cochlear implantation.

Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech

Speech Audiometry at Home: Automated Listening Tests via Smart Speakers With Normal-Hearing and Hearing-Impaired Listeners

Visualization of Speech Perception Analysis via Phoneme Alignment: A Pilot Study

Automatically measuring speech fluency in people with aphasia: first achievements using read-speech data

Automated Measurement of Speech Recognition, Reaction Time, and Speech Rate and Their Relation to Self-Reported Listening Effort for Normal-Hearing and Hearing-Impaired Listeners Using various Maskers

Validating a Method to Assess Lipreading, Audiovisual Gain, and Integration During Speech Reception With Cochlear-Implanted and Normal-Hearing Subjects Using a Talking Head

Latent Phrase Matching for Dysarthric Speech

The development of Cantonese Lexical Neighborhood Test: a pilot study

New sentence recognition materials developed using a basic non-native English lexicon

A Large-Scale Study of the Relationship Between Degree and Type of Hearing Loss and Recognition of Speech in Quiet and Noise

Individual Aided Speech-Recognition Performance and Predictions of Benefit for Listeners With Impaired Hearing Employing FADE

An Anechoic, High-Fidelity, Multidirectional Speech Corpus

Feasibility of an Adaptive Version of the Everyday Conversational Sentences in Noise Test

A Strategic Approach for Robust Dysarthric Speech Recognition

Who is Right? A Word-Identification-in-Noise Test for Young Children Using Minimal Pair Distracters

The Thomas More Lists: A Phonemically Balanced Dutch Monosyllabic Speech Audiometry Test

Global access to speech hearing tests

Nonwords Pronunciation Classification in Language Development Tests for Preschool Children

Development of Singapore English speech audiometry test materials

Reference Data for a Quick Speech-in-Noise Hearing Test in the French Language

Validating a novel paradigm for simultaneously assessing mismatch response and frequency-following response to speech sounds