Low-dimensional Representation of Gaussian Mixture Model Supervector for Language Recognition

Jinchao Yang,Xiang Zhang,Hongbin Suo,Li Lu,Jianping Zhang,Yonghong Yan
DOI: https://doi.org/10.1186/1687-6180-2012-47
2012-01-01
EURASIP Journal on Advances in Signal Processing
Abstract:In this article, we propose a new feature which could be used for the framework of SVM-based language recognition, by introducing the idea of total variability used in speaker recognition to language recognition. We consider the new feature as low-dimensional representation of Gaussian mixture model supervector. Thus we propose multiple total variability (MTV) language recognition system based on total variability (TV) language recognition system. Our experiments show that the total factor vector includes the language dependent information; what's more, multiple total factor vector contains more language dependent information.Experimental results on 2007 National Institute of Standards and Technology (NIST) Language Recognition Evaluation (LRE) databases show that MTV outperforms TV in 30 s tasks, and both TV and MTV systems can achieve performance similar to that obtained by state-of-the-art approaches. Best performance of our acoustic language recognition systems can be further improved by combining these two new systems.
What problem does this paper attempt to address?