Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers.

Wenping Hu,Yao Qian,Frank K. Soong,Yong Wang
DOI: https://doi.org/10.1016/j.specom.2014.12.008
IF: 2.723
2015-01-01
Speech Communication
Abstract:•The discrimination of acoustic model for GOP calculation is improved by DNN training.•F0 is added to DNN model for the detection of misusing lexical stress or tone.•The GOP measure is improved to robustly evaluate pronunciation of non-native speech.•A transfer learning based classifier is proposed to mispronunciation detection.
What problem does this paper attempt to address?