Abstract:A semi-continuous hidden Markov model based on the multiple vector quantization codebooks is used here for large-vocabulary speaker-independent continuous speech recognition. In the techniques employed here, the semi-continuous output probability density function for each codebook is represented by a combination of the corresponding discrete output probabilities of the hidden Markov model and the continuous Gaussian density functions of each individual codebook. Parameters of the vector quantization codebook and the hidden Markov model are mutually optimized to achieve an optimal model/codebook combination under a unified probabilistic framework. Another advantage of this approach is the enhanced robustness of the semi-continuous output probability density function by the combination of multiple codewords and multiple codebooks. For a 1000-word speaker-independent continuous speech recognition using a word-pair grammar, the recognition error rate of the semi-continuous hidden Markov model was reduced by more than 29% and 40% in comparison to the discrete and continuous mixture hidden Markov model respectively. This research was sponsored in part by the Defense Advanced Research Projects Agency under Contract N00039-85-C-0163. The views and conclusions contained in this document are those of the author and should not be interpreted as representing the official policies, either expressed or implied, of the Defense Advanced Research Projects Agency, or the US Government. X.D. Huang is a holder of an Edinburgh University Studentship and ORS Awards. Visiting scientist from CSTR, University of Edinburgh, 80, South Bridge, Edinburgh EH1 1HN, Scodand

Multiple Codebook Semi-Continuous Hidden Markov Models for Speaker-Independent Continuous Speech Recognition

Semi-continuous Segmental Probability Modeling for Continuous Speech Recognition.

Maximum Likelihood I-Vector Space Using PCA for Speaker Verification.

On the Embedded Multiple-Model Scoring Scheme for Speech Recognition

Probabilistic Speaker-Class Based Acoustic Modeling for Large Vocabulary Continuous Speech Recognition

The Hidden Markov Model of co-articulation and its application to the continuous speech recognition

Speaker recognition using continuous density support vector machines

A Novel HTS System Using both Continuous HMMs and Discrete HMMs

A Novel Hmm-Based Tts System Using Both Continuous Hmms And Discrete Hmms

FAST LIKELIHOOD COMPUTATION METHOD USING BLOCK-DIAGONAL COVARIANCE MATRICES IN HIDDEN MARKOV MODEL

A New Model for Speech Recognition : Center-Distance Continuous Probability Model

Stereo Hidden Markov Modeling for Noise Robust Speech Recognition

An Efficient Computation Algorithm In Mandarin Continuous Speech Recognition

Construction of a compact dynamic decoder network for large vocabulary continuous speech recognition

From Linear Prediction HMM to a New Combined Model for Speech Recognition

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

The Cohort-Selection And Normalized Hidden Markov Model For Speaker Recognition

Discriminative Dynamic Gaussian Mixture Selection with Enhanced Robustness and Performance for Multi-Accent Speech Recognition

Distributed Submodular Maximization for Large Vocabulary Continuous Speech Recognition

Codebook-Based Speaker Adaptation

Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition