Deep Neural Network based Uyghur Large Vocabulary Continuous Speech Recognition

Tuerxun Maimaitiaili,Lirong Dai
DOI: https://doi.org/10.16337/j.1004-9037.2015.02.015
2015-01-01
Abstract:Two methods are proposed by employing deep neural network for Uyghur large vocabulary con-tinuous speech recognition :Hybrid architecture models are established with deep neural network (DNN) and hidden Markov model (HMM)for replacing Gaussian mixture model (GMM)in GMM-HMM to compute the state emission probabilities;DNN is facilitated as a front-end acoustic feature extractor to extract bottleneck feature(BN)to provide more effective acoustic features for the traditional GMM-HMM modeling framework (BN-GMM-HMM).The experimental results show that DNN-HMM and BN-GMM-HMM reduce word error rate(WER)by 8.84% and 5.86% compared with the GMM-HMM base-line system,which demonstrates that the two methods accomplish significant performance improve-ments.
What problem does this paper attempt to address?