Transfer Learning for Acoustic Modeling of Noise Robust Speech Recognition

YI Jiangyan,TAO Jianhua,LIU Bin,WEN Zhengqi
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2018.21.001
2018-01-01
Abstract:Speech recognition in noisy environments was improved by using transfer learning to train acoustic models.The training of an acoustic model trained with noisy data (student model) is guided by an acoustic model trained with clean data (teacher model).This training process forces the posterior probability distribution of the student model to be close to the teacher model by minimizing the Kullback-Leibler (KL) divergence between the posterior probability distribution of the student model and that of the teacher model Tests on the CHiME-2 dataset show that this method gives a 7.29% absolute average word error rate (WER) improvement over the baseline model and 3.92% absolute average WER improvement over the best CHiME-2 system.
What problem does this paper attempt to address?