Abstract:Neural network language model (NNLM) has achieved very good results in the field of speech recognition, machine translation, etc. Direct decoding with NNLM is challenging for the overwhelmingly heavy burden in complexity. Most of the previous work focused on rescoring the N-best list and lattice with NNLM in the second pass. In this work, several techniques are explored to directly incorporate the NNLM into the decoder of speech recognition. A novel training algorithm based on variance regularization is proposed to approximate the softmax-normalizing factor as a constant for fast evaluation. Also, the evaluation of NNLM is further speeded up via our advanced storage. Moreover, a simple cache-based strategy is explored to avoid redundant computations during the decoding process. To the authors' knowledge, it is the first time to directly incorporate NNLM into decoding. We evaluate our proposed methods on an English-Switchboard phone-call speech-to-text task. Experimental results show that incorporating the NNLM into the decoder significantly reduces the word error rate (WER) by 1.5% and 1.4% absolutely on the Hub5'00-SWB and RT03S-FSH sets, respectively. Also, the decoding with NNLM is twice as fast as the baseline at the same word error rate.

A Novel Efficient Decoding Algorithm for CDHMM-based Speech Recognizer on Chip.

Multi-Pass Decoding Algorithm Based on a Speech Recognition Chip

A Novel and Efficient Voice Activity Detector Using Shape Features of Speech Wave.

A Novel MPEG Audio Decoder Design and Improved Algorithm

The dynamically-adjustable histogram pruning method for embedded voice dialing

Real-time Speaker Recognition System for PDA

Real-Time Speech Recognition Method for Embedded System

Efficient One-Pass Decoding with Nnlm for Speech Recognition

Application and Implementation of Dynamically-Adjustable Histogram Pruning for PDA Voice Dialing

Construction of a compact dynamic decoder network for large vocabulary continuous speech recognition

Novel Efficient Algorithms in Speech / Speaker Recognition

Implementation of Convolutional Coding and Viterbi Decoding in DRM System

Moderate Vocabulary English Speech Recognition System Embedded on a Chip

A Fast Error-tolerant Algorithm in Decoding Module of Speech Recognition

Acceleration Strategies for Speech Recognition Based on Deep Neural Networks

Design and implementation of real-time telephone speech recognition system using DSP TMS320C31

An Efficient Computation Algorithm In Mandarin Continuous Speech Recognition

Efficient Embedded Speech Recognition for Very Large Vocabulary Mandarin Car-Navigation Systems

Speech recognition system on chip based on 5507 DSP

High Performance Mandarin Digit Recognition System on a DSP Chip