A Novel I-Vector Framework Using Multiple Features and PCA for Speaker Recognition in Short Speech Condition

Chi Zhang,Xiaoqiang Li,Wei Li,Peizhong Lu,Wenqiang Zhang
DOI: https://doi.org/10.1109/icalip.2016.7846558
2016-01-01
Abstract:Speaker recognition in short speech condition is a difficult topic because the length of training and test speech is very short. One of the main disadvantage of the existing methods for speaker recognition is that they need very sufficient data and it's usually impossible in reality applications. In our experiments, the conventional methods with single feature don't make good performance in short speech. We propose a novel i-vector framework using multiple features and Principal Component Analysis (PCA) in short speech condition to overcome this difficulty, as multiple features combination can represent more aspects of a speaker. PCA is used to map the multiple features to an uncorrelated and orthogonal basis set to meet the requirements of Gaussian Mixture Model (GMM) with diagonal covariance matrices and i-vector. Improvement from the proposed approach compared to a state-of-the-art system are of roughly 50% relative at equal error rate when evaluated on the telephone conditions from the 2010 NIST speaker recognition evaluation (SRE).
What problem does this paper attempt to address?