Improved Posterior Probability Estimation Methods for the Freely-Spoken Speech Evaluation

Sukui XU,Lirong DAI,Si WEI,Qingfeng LIU,Qianyong GAO
2017-01-01
Abstract:Two methods under the deep neural network acoustic modeling framework are proposed to improve the estimation of posterior probability for evaluation of pronunciation of freely-spoken speech:1) the posterior probability is re-estimated with more accurate recognition results by employing RNN language model to re-score the N-best candidates produced from the first decoding process;2) the influence of dialect to posterior probability is taken into account by involving likelihood scores produced by dialect clustered nodes added to deep neural network acoustic model which is re-trained as a multi-lingual style.Experimental results show that these methods increase the correlation (between posterior probabilities and human scores) for 3.5 % and 1.0 % respectively,and the combination of these two methods achieves 4.9 % increase.In a real evaluation task,a 2.2 % absolute improvement is observed in correlation between machine scores and human scores.
What problem does this paper attempt to address?