Score Regulation Based on GMM Token Ratio Similarity for Speaker Recognition

Yingchun Yang,Licai Deng
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2017.21.006
2017-01-01
Abstract:Summary form only given. A novel approach named GTRSR (GMM Token Ratio Similarity based Score Regulation) for speaker recognition is presented in this paper, which judge the reliability of a test score based on GMM Token Ratio Similarity. GMM Token which is the index of the UBM component giving the highest score is saved for each frame during the training and test phase. Then the amount for each GMM Token is added up to form a vector GTR which stands for the GMM Token ratio of an utterance. In the test phase, we compute the similarity between the GMM Token ratio of test utterance and training utterance for a target speaker, i.e. GTRS. When GTRS is smaller than a threshold, the original likelihood score is regulated by multiplying a penalty factor as the final score of this test utterance. Experiments conducted on MASC@CCNT show our GTRSR can improve the performance of speaker recognition.
What problem does this paper attempt to address?