MAP-based Speaker Adaptation in Speech Synthesis

Zhao Huanhuan,Ling Zhenhua,Wang Renhua,Dai Lirong
DOI: https://doi.org/10.3969/j.issn.1004-9037.2010.04.014
2010-01-01
Abstract:In HMM-based speaker adaptation in the speech synthesis,the usual maximum likelihoad linear regression(MLLR) algorithm still has some disparities in timbre and similarity compared with the nature voice.For improving better results in the speaker adaptation,an adaptation way based on structure maximum aposteriori probability(SMAP) is preseuted in speech synthesis from some speech recognition theory.Through experiments on MLLR,maximum aposteriori probability(MAP) and SMAP in the parameter and the data selection,how to better improve the effect of the speaker adaptation is discussed.Experiments show that the method provides a more efficient adaptation approach than general MLLR in the timbre and the similarity.
What problem does this paper attempt to address?