Syllable HMM Based Mandarin TTS and Comparison with Concatenative TTS.

Zhiwei Shuang,Shiyin Kang,Qin Shi,Yong Qin,Lianhong Cai
DOI: https://doi.org/10.21437/interspeech.2009-145
2009-01-01
Abstract:This paper introduces a Syllable HMM based Mandarin ITS system. 10-state left-to-right HMMs are used to model each syllable. We leverage the corpus and the front end of a concatenative TTS system to build the Syllable HMM based TTS system. Furthermore, we utilize the unique consonant/vowel structure of Mandarin syllable to improve the voiced/unvoiced decision of HMM states. Evaluation results show that the Syllable HMM based Mandarin TTS system with a 5.3MB's model size can achieve an overall quality close to a concatenative ITS system with 1GB' data size.
What problem does this paper attempt to address?