Uyghur speech recognition based on deep neural network

Qimike·Batexi,Hao HUANG,Xian-hui WANG
DOI: https://doi.org/10.16208/j.issn1000-7024.2015.08.045
2015-01-01
Abstract:Currently speech recognition is mainly achieved by using hidden Markov models.However,after taking the triphone model into account,the scale of parameters greatly increases,in the circumstances of limited training data,the model parameters are not well trained,thus affecting the speech recognition rate.To improve the speech recognition rate,the method for speech recognition based on deep neural network was proposed.A neural network containing four hidden layers was trained on the kaldi platform,and the model was used to deal with the Uyghur speech recognition.Experimental results show that the error in Uy-ghur speech recognition is reduced by 31.09% and 8.68% respectively using the deep the neural network model compared to that using the basic tone sub-HMM and HMM triphone.And all models of existing optimization algorithm are still valid in this model.
What problem does this paper attempt to address?