Exploiting Glottal Information in Speaker Recognition Using Parallel GMMs

Pu Yang,Yingchun Yang,Zhaohui Wu
DOI: https://doi.org/10.1007/11527923_84
2005-01-01
Abstract:The information of the vocal tract and the glottis are two kinds of sources which can characterize speakers. Though the former one has archived quite good performance in automatic speaker recognition (ASR) tasks, the glottal information behaves poorly when used individually. This work explores how to combining vocal tract and glottal information in an efficient and effective way. Taking into account the short-term correlation between them, our improved joint probability function model of the corresponding features is first proposed. Then we present a novel integrating system which uses parallel Gaussian Mixture Models (GMM) grounded on this function. Together with the traditional GMM, it also forms a hybrid model. Both methods were applied to YOHO and SRMC corpus, and experimental works show promising results.
What problem does this paper attempt to address?