Construction of a compact dynamic decoder network for large vocabulary continuous speech recognition

Jia Liu,Xie Chen,Yuxiang Shan,Yongzhe Shi
2012-01-01
Abstract:Large vocabulary continuous speech recognition systems (LVCSR) involve various knowledge sources, such as an acoustic model, a language model and a pronunciation dictionary. The decoder network as the basis of the decoder has a critical influence on the decoder performance. By effectively integrating these knowledge sources, a compact decoder network can reduce the search space and avoid repeated computations, which accelerates the recognition speed. This paper describes a compact dynamic decoder network based on hidden Markov model states as the network node, with an efficient word end pushing algorithm for speech recognition. The algorithm combines traditional forward and backward combination algorithms to reduce the number of nodes and edges by a factor of 4 compared to a linear lexical decoder network and with half as many nodes as the well-known open source tool HDecode. The number of nodes needed to calculate the look-ahead score is cut in half. This acoustic model is based on three phonemes so decoder networks can easily be built for other languages.
What problem does this paper attempt to address?