Minimum Generation Error Training for HMM-Based Speech Synthesis

Yi-Jian Wu,Ren-Hua Wang
DOI: https://doi.org/10.1109/icassp.2006.1659964
2006-01-01
Abstract:In HMM-based speech synthesis, there are two issues critical related to the MLE-based HMM training: the inconsistency between training and synthesis, and the lack of mutual constraints between static and dynamic features. In this paper, we propose minimum generation error (MGE) based HMM training method to solve these two issues. In this method, an appropriate generation error is defined, and the HMM parameters are optimized by using the generalized probabilistic descent (GPD) algorithm, with the aims to minimize the generation errors. From the experimental results, the generation errors were reduced after the MGE-based HMM training, and the quality of synthetic speech is improved
What problem does this paper attempt to address?