Speech Recognition Using Speaker Adaptation by System Parameter Transformation.

Ying Hao,Ditang Fang
DOI: https://doi.org/10.1109/89.260335
1994-01-01
IEEE Transactions on Speech and Audio Processing
Abstract:Presents a speaker adaptation scheme which transforms the prototype speaker's Hidden Markov word models to those of a new speaker. Transformations are applied to both the state transition matrix and the probability distribution functions of a hidden Markov word model. These transformations are optimized through maximizing the joint probability of a set of input pronunciations of the new speaker. Details of these parameter transformations and experimental verification are presented. The test uses a 210-word vocabulary with each having a four-state Hidden Markov word model. The test speaker consists of three males and two females with one male heavily accented. By having the system retrained up to a four-minute adaptation speech, a subset of the 210-word vocabulary, the performance shows an improvement of recognition accuracy from 22.5% to 92.1%.
What problem does this paper attempt to address?