Human auditory cortex activity shows additive effects of spectral and spatial cues during speech segregation.

Yi Du,Yu He,Bernhard Ross,Tim Bardouille,Xihong Wu,Liang Li,Claude Alain
DOI: https://doi.org/10.1093/cercor/bhq136
IF: 4.861
2011-01-01
Cerebral Cortex
Abstract:In noisy social gatherings, listeners perceptually integrate sounds originating from one person's voice (e.g., fundamental frequency (f(0)) and harmonics) at a particular location and segregate these from concurrent sounds of other talkers. Though increasing the spectral or the spatial distance between talkers promotes speech segregation, synergetic effects of spatial and spectral distances are less well understood. We studied how spectral and/or spatial distances between 2 simultaneously presented steady-state vowels contribute to perception and activation in auditory cortex using magnetoencephalography. Participants were more accurate in identifying both vowels when they differed in f(0) and location than when they differed in a single cue only or when they shared the same f(0) and location. The combined effect of f(0) and location differences closely matched the sum of single effects. The improvement in concurrent vowel identification coincided with an object-related negativity that peaked at about 140 ms after vowel onset. The combined effect of f(0) and location closely matched the sum of the single effects even though vowels with different f(0), location, or both generated different time courses of neuromagnetic activity. We propose that during auditory scene analysis, acoustic differences among the various sources are combined linearly to increase the perceptual distance between the co-occurring sound objects.
What problem does this paper attempt to address?