An unvoiced/voiced duration adjustment algorithm based on context features in mandarin TTS

徐英进,王永鑫,蔡莲红
DOI: https://doi.org/10.3969/j.issn.2095-2783.2012.10.009
2012-01-01
Abstract:In Mandarin TTS,the duration of unvoiced and voiced phonemes in a syllable is a very important factor related to the naturalness of synthesized speech.We propose an unvoiced/voiced duration adjustment algorithm based on context features for HMM-based Mandarin TTS.In the algorithm,the relative duration of the unvoiced part in a syllable is clustered with context features.During the synthesis,a reference relative duration of the unvoiced part is generated from the decision tree,and the duration of the unvoiced part and voiced part in the synthesized speech is adjusted accordingly.Experiments show that this algorithm can improve the accuracy of duration prediction in HMM-based Mandarin TTS,and can effectively improve the naturalness of synthesized speech.
What problem does this paper attempt to address?