Auditory Model Based Speech Feature Extraction and Its Application to Speaker Identification

Bing Xiang,Xihong Wu,Zhimin Liu,Huisheng Chi
DOI: https://doi.org/10.1109/ijcnn.1999.831502
1999-01-01
Abstract:According to the characteristics of the auditory periphery and cochlear nucleus, as well as attempting to simulate the mechanism of auditory system as a whole, two kinds of novel speech feature are presented in this paper, and a framework of neural network has been adopted. The two features considered are: the weighted average localized synchronized rate cepstrum, and the weighted firing rate cepstrum. Both of them are applied to speaker identification. The modular tree and modified linear opinion pools are used as classifiers to simulate the parallel processing mechanism of the upper level function of auditory system. Good recognition accuracy is obtained under both clean and noisy environments
What problem does this paper attempt to address?