Speaker Adaptation with MAP Estimation and Weighted Neighbor Regression

HE Lei,FANG Ditang,WU Wenhu
DOI: https://doi.org/10.3321/j.issn:1000-0054.2001.01.016
2001-01-01
Abstract:This paper describes a novel speaker adaptation framework that combines the maximum a posteriori (MAP) estimation and wighted neighbor regression (WNR) methods. A great deal of adaptation data is required in MAP adaptation because only the parameters of those models with adaptation data can be updated. To alleviate this disadvantage, a technique called WNR is presented in which the parameter relationships between the speaker independent models and the speaker adaptation models are trained by applying distance weighted regression to a set of neighbor model parameters with and without MAP adaptation. The Chinese syllable recognition error is reduced nearly 15 percent with 10 adaptation utterances and more than 50 percent with 250 utterances. In addition, vector field smoothing (VFS) can be proved to be a degenerate case of WNR.
What problem does this paper attempt to address?