A neural network model for encoding and perception of vowel sounds

Osamu Hoshino,Masayuki Miyamoto,MeiHong Zheng,Kazuharu Kuroiwa
DOI: https://doi.org/10.1016/S0925-2312(02)00397-1
IF: 6
2002-01-01
Neurocomputing
Abstract:By simulating a hierarchical neural network model, we investigated neuronal bases for encoding and perception of vowel sounds. The lower network detects spectral peaks called formant frequencies of a vowel sound. The higher network detects the combinatory information of the first (F1) and second (F2) formant frequencies. We trained the model with five Japanese vowels spoken by different people and modified synaptic connections according to the Hebbian rule. The present model can recognize not only learned vowel sounds but also vowel sounds that model hears for the first time. We suggest that an ‘unknown’ vowel sound can be perceived if they activate portion of a cell assembly whose population activation encodes information about the category of the unknown vowel sound.
What problem does this paper attempt to address?