Modelling and Decision Tree Based Prediction of Pitch Contour in Ibm Mandarin Speech Synthesis System

Xiaochuan Niu,Liqin Shen,Weibin Zhu,Qin Shi
2000-01-01
Abstract:In this paper, a method of pitch contour modelling based on the hidden Markov model (HMM) states of an acoustic unit is presented. A pair of vectors is computed from the alignment of the speech data with the acoustic unit’s HMM states. The pitch contour feature of the acoustic unit is represented by the vector pair so that the variants of the acoustic unit’s pitch contour can be measured and compared. Using this model, pitch contour decision trees are constructed for phones in Mandarin from a single speaker’s continuous reading speech database. The trees are used in the Mandarin speech synthesis system, which is trained over the same database, to predict the pitch contour of a certain phone according to its phone context. The naturalness of the synthesized Mandarin speech is highly improved.
What problem does this paper attempt to address?