Perceptual uncertainty explains activation differences between audiovisual congruent speech and McGurk stimuli

Chenjie Dong,Uta Noppeney,Suiping Wang
DOI: https://doi.org/10.1002/hbm.26653
IF: 4.8
2024-03-16
Human Brain Mapping
Abstract:This study investigated the neurocognitive mechanisms underlying the McGurk illusion. We found that McGurk stimuli increased activations in a widespread neural system, and the activation level of this system varied across unisensory and audiovisual congruent stimuli. The activation differences between McGurk and congruent stimuli can be attributed to perceptual uncertainty. Face‐to‐face communication relies on the integration of acoustic speech signals with the corresponding facial articulations. In the McGurk illusion, an auditory /ba/ phoneme presented simultaneously with a facial articulation of a /ga/ (i.e., viseme), is typically fused into an illusory 'da' percept. Despite its widespread use as an index of audiovisual speech integration, critics argue that it arises from perceptual processes that differ categorically from natural speech recognition. Conversely, Bayesian theoretical frameworks suggest that both the illusory McGurk and the veridical audiovisual congruent speech percepts result from probabilistic inference based on noisy sensory signals. According to these models, the inter‐sensory conflict in McGurk stimuli may only increase observers' perceptual uncertainty. This functional magnetic resonance imaging (fMRI) study presented participants (20 male and 24 female) with audiovisual congruent, McGurk (i.e., auditory /ba/ + visual /ga/), and incongruent (i.e., auditory /ga/ + visual /ba/) stimuli along with their unisensory counterparts in a syllable categorization task. Behaviorally, observers' response entropy was greater for McGurk compared to congruent audiovisual stimuli. At the neural level, McGurk stimuli increased activations in a widespread neural system, extending from the inferior frontal sulci (IFS) to the pre‐supplementary motor area (pre‐SMA) and insulae, typically involved in cognitive control processes. Crucially, in line with Bayesian theories these activation increases were fully accounted for by observers' perceptual uncertainty as measured by their response entropy. Our findings suggest that McGurk and congruent speech processing rely on shared neural mechanisms, thereby supporting the McGurk illusion as a valid measure of natural audiovisual speech perception.
radiology, nuclear medicine & medical imaging,neurosciences,neuroimaging
What problem does this paper attempt to address?