The Pause Duration Prediction for Mandarin Text-to-speech System

J Yu,JH Tao
DOI: https://doi.org/10.1109/nlpke.2005.1598735
2005-01-01
Abstract:In this paper, we enter into detailed analysis on how the pause duration under different prosodic boundaries are affected by various contextual factors in natural speech. To get the correlation between them, the paper calculates the mean pause duration under different prosodic boundaries. The contextual factors investigated in this paper contains both linguistic features, such as boundary types, syllable tones of boundary sides, initial and final types etc, and acoustic features, such as pitch gap across the boundary. The paper makes experiments and discussion which reveals the influence of these factors on pause duration. Based on that, the paper creates a pause duration prediction model for Mandarin speech synthesis system. The model was proved to be able to generate high quality prosody output with the listening test.
What problem does this paper attempt to address?