Automatic Phrase Boundary Labeling for a Mandarin TTS Corpus Using the Viterbi Decoding Algorithm

杨辰雨,朱立新,凌震华,戴礼荣
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2011.09.025
2011-01-01
Abstract:An automatic prosodic phrase boundary labeling method was developed for Mandarin speech synthesis corpora using the Viterbi decoding algorithm.The algorithm quickly and precisely labels the prosodic phrase boundaries of Mandarin speech,which will reduce the cost of constructing a large corpus based Mandarin unit selection speech synthesis system.The method can be divided into the model training stage and the prosody labeling stage.The training prepares a context-dependent hidden Markov model(HMM) of the spectrum,F0 and phone duration.In the labeling stage,the Viterbi decoding algorithm is used to label the prosodic phrase boundaries using the training stage models.Tests show that the system gives an F-score of 77.64% for the prosodic phrase boundary labeling.The prosodic labeling categories can be increased easily in this method.
What problem does this paper attempt to address?