The Study of Vocal Tract Length Normalization Based on Single Mixture in Noisy Environment

ZHANG Wen-ming,ZHANG Xiang-dong,ZHANG Xing-gan,HOU Zhen
DOI: https://doi.org/10.3969/j.issn.1002-2279.2006.05.035
2006-01-01
Microprocessors
Abstract:Vocal tract length normalization is one of speaker adaptation in speech recognition.In this paper,we focus on the study of it and do a series of experiments.In its realization,we firstly adopt the means on scale factor which is based on single mixture in noisy environment and reach the better result.The experiments are based on AURORA speech database.We recognize the models using the test set in noisy car environment which is included in AURORA speech database.The results show that in various noise the recognized results of the VTLN are better than those of no VTLN.Iterative training can improve the performance of single turn VTLN and the optimal result is in third turns.In noisy environment,the average sentence correction based on the scale factor of single mixture is improved more 1.68 percent than that of the other mixtures. The gender independent performance of no VTLN is close to the gender dependent performance of VTLN.If the training data is sufficent,the gender independent performance of VTLN is better.
What problem does this paper attempt to address?