Abstract:Objectives: Visual and contextual cues facilitate speech recognition in suboptimal listening conditions (e.g., background noise, hearing loss, hearing aid signal processing). Moreover, successful speech recognition in challenging listening conditions is linked to cognitive abilities such as working memory and fluid intelligence. However, it is unclear which cognitive abilities facilitate the use of visual and contextual cues in individuals with normal hearing and hearing aid users. The first aim was to investigate whether individuals with hearing aid users rely on visual and contextual cues to a higher degree than individuals with normal hearing in a speech-in-noise recognition task. The second aim was to investigate whether working memory and fluid intelligence are associated with the use of visual and contextual cues in these groups. Design: Groups of participants with normal hearing and hearing aid users with bilateral, symmetrical mild to severe sensorineural hearing loss were included (n = 169 per group). The Samuelsson and Rönnberg task was administered to measure speech recognition in speech-shaped noise. The task consists of an equal number of sentences administered in the auditory and audiovisual modalities, as well as without and with contextual cues (visually presented word preceding the sentence, e.g.,: "Restaurant"). The signal to noise ratio was individually set to 1 dB below the level obtained for 50% correct speech recognition in the hearing-in-noise test administered in the auditory modality. The Reading Span test was used to measure working memory capacity and the Raven test was used to measure fluid intelligence. The data were analyzed using linear mixed-effects modeling. Results: Both groups exhibited significantly higher speech recognition performance when visual and contextual cues were available. Although the hearing aid users performed significantly worse compared to those with normal hearing in the auditory modality, both groups reached similar performance levels in the audiovisual modality. In addition, a significant positive relationship was found between the Raven test score and speech recognition performance only for the hearing aid users in the audiovisual modality. There was no significant relationship between Reading Span test score and performance. Conclusions: Both participants with normal hearing and hearing aid users benefitted from contextual cues, regardless of cognitive abilities. The hearing aid users relied on visual cues to compensate for the perceptual difficulties, reaching a similar performance level as the participants with normal hearing when visual cues were available, despite worse performance in the auditory modality. It is important to note that the hearing aid users who had higher fluid intelligence were able to capitalize on visual cues more successfully than those with poorer fluid intelligence, resulting in better speech-in-noise recognition performance.

Multisensory benefits for speech recognition in noisy environments

Speech-derived haptic stimulation enhances speech recognition in a multi-talker background

EXPRESS: Prior multisensory learning can facilitate auditory-only voice-identity and speech recognition in noise

Cross-modal Mask Fusion and Modality-Balanced Audio-Visual Speech Recognition

Correlation Between Audio–visual Enhancement of Speech in Different Noise Environments and SNR: A Combined Behavioral and Electrophysiological Study

Vision Perceptually Restores Auditory Spectral Dynamics in Speech

Neuronal basis of audio-tactile speech perception

Immediate improvement of speech-in-noise perception through multisensory stimulation via an auditory to tactile sensory substitution

Spatial alignment between faces and voices improves selective attention to audio-visual speech

Prediction and constraint in audiovisual speech perception

Visual Facial Enhancements Can Significantly Improve Speech Perception in the Presence of Noise

Enhancing speech perception in noise through articulation

Implicit multisensory associations influence voice recognition

Reassessing the Benefits of Audiovisual Integration to Speech Perception and Intelligibility

Relationships Between Hearing Status, Cognitive Abilities, and Reliance on Visual and Contextual Cues

Visual Cues Contribute Differentially to Audiovisual Perception of Consonants and Vowels in Improving Recognition and Reducing Cognitive Demands in Listeners With Hearing Impairment Using Hearing Aids

Improved tactile speech perception using audio-to-tactile sensory substitution with formant frequency focusing

Effect of enhancement of spectral changes on speech intelligibility and clarity preferences for the hearing impaired.

Interference of mid-level sound statistics underlie human speech recognition sensitivity in natural noise

Overlapping frequency coverage and simulated spatial cue effects on bimodal (electrical and acoustical) sentence recognition in noise

The effects of temporal cues, point-light displays, and faces on speech identification and listening effort