Voiced/unvoiced Decision Algorithm for HMM-based Speech Synthesis

Shiyin Kang,Zhiwei Shuang,Quansheng Duan,Yong Qin,Lianhong Cai
DOI: https://doi.org/10.21437/interspeech.2009-138
2009-01-01
Abstract:This paper introduces a novel method to improve the U/V decision method in HMM-based speech synthesis. In the conventional method, the U/V decision of each state is independently made, and a state in the middle of a vowel may be decided as unvoiced. In this paper, we propose to utilize the constraints of natural speech to improve the U/V decision inside a unit, such as syllable or phone. We use a GMM-based U/V change time model to select the best U/V change time in one unit, and refine the UN decision of all states in that unit based on the selected change time. The result of a perceptual evaluation demonstrates that the proposed method can significantly improve the naturalness of the synthetic speech.
What problem does this paper attempt to address?