Language Recognition Based on Unsupervised Transfer Component Analysis

徐嘉明,张卫强,刘加,夏善红
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2013.06.014
2013-01-01
Abstract:Distribution mismatches between training and test datasets can greatly reduce the performance of language recognition systems. The mismatch is typically due to variability from changes in the channel and other factors. Real-world applications often have many training samples from other source domains but only a limited number of labeled training samples from the target domain. This study uses transfer learning to find a low-dimensional subspace through unsupervised transfer component analysis (UTCA). This space minimizes the distribution mismatch between the source and target domain samples while preserving the good data properties. Tests show that the UTCA gives 24.7% and 8% relative improvement at 30 s and 10 s durations over the baseline system.
What problem does this paper attempt to address?