Discriminative Vector Space Model Based Language Recognition

刘巍巍,张卫强,刘加
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2013.06.013
2013-01-01
Abstract:Conventional language recognition tasks are limited by the need for large training datasets, in which most of the discriminative information is overlapped. Moreover, the non-language variabilities (such as channel and speaker differences) also affect the performance of language recognition systems. This paper describes a method using discriminative vector space models (D-VSMs) where the overlapping training information is automatically eliminated. Thus, every VSM is trained for one special situation, and the whole system has good performance. D-VSMs only use 30% of the training data of the baseline system and cost only 10% computation of the baseline with the equal error rate (EER) for the system in the National Institute of Standards and Technology (NIST) Language Recognition Evaluation (LRE) 2009 Database reduced 12.75%, 15.89% and 7.33% in 30 s, 10 s and 3 s tests.
What problem does this paper attempt to address?