Hierarchical Perception Of Monosyllabic Sounds

K. Waki,T. Ono,O. Hoshino,M. Zheng,K. Kuroiwa
2001-01-01
Abstract:Based on a hierarchical neural network architecture proposed by Waki et al. [1], we tried to clarify neuronal mechanisms of the perception of monosyllabic vocal sounds. A neural network of the first stage decomposes monosyllabic sounds into time varying spectral elements. Three neural networks of the second stage integrate these spectral elements into point attractors corresponding to noise burst component, frequency-modulated component, and constant frequency component of monosyllables. A neural network of the third stage integrates these three components into single point attractors corresponding to the monosyllables. We demonstrate that the perception of a monosyllabic sound is the emergence of a single point attractor at the third stage corresponding to the monosyllable, which is mediated by the three point attractors at the second stage corresponding to the three vocal components of the monosyllable. We suggest that the FM component is necessary for encoding monosyllables into relevant dynamical point attractors but not for monosyllabic sound perception.
What problem does this paper attempt to address?