Combined GMM-UBM and SVM Speaker Identification System

Fang Zheng
2008-01-01
Abstract:The Gaussian mixture model-universal background model(GMM-UBM) speaker identification system uses the features of each frame to model and identify the characteristics of the target speaker but has poor robustness to channel effects.The support vector machine(SVM) speaker identification system uses the mean vector of each Gaussian mixture of the frame vectors to model and identify the speaker with much more robust channel effects but while ignoring the characteristics of the target speaker.Tests of a combined strategy integrate the advantages of these two systems on the National Institute of Standards and Technology(NIST) evaluation corpus show that a linear combination of the GMM-UBM system which had an equal error rate(EER) of 9.30% and an SVM-EAP system with an EER of 8.06% gave a final EER of 7.34%.
What problem does this paper attempt to address?