Key Technology Research for Speech Recognition

Xi Xiaojing,Lin Kunhui,Zhou Changle,Cai Jun
DOI: https://doi.org/10.3321/j.issn:1002-8331.2006.11.021
2006-01-01
Abstract:Because of the application of the Hidden Markov Model(HMM) in acoustic modeling,a significant breakthrough has been made in recognizing continuous speech with a large glossary.However,some unreasonable hypotheses for acoustic modeling and the unclassified training algorithm on which the HMM based form a bottleneck,restricting the further improvement in speech recognition.The Artificial Neural Network(ANN) techniques can be adopted as an alternative modeling paradigm.By means of the weight values of the network connections,neural networks can steadily store the knowledge acquired from the training process.But they possess a weak memory,not being suitable to store the instantaneous response to various input modes.To overcome the flaws of the HMM paradigm,we design a hybrid HMM/ANN model.In this hybrid model,the nonparametric probabilistic model(a BP neural network) is used to substitute the Gauss blender to calculate the observed probability which is necessary for computing the states of the HMM model.Besides,we optimize the structure of the network,and experiments show that the hybrid model has a good performance in speech recognition.
What problem does this paper attempt to address?