A Tree-Based Model of Prosodic Phrasing for Chinese Text-to-Speech Systems

Weijun Chen,Fuzong Lin,Jianmin Li,Bo Zhang
DOI: https://doi.org/10.1007/3-540-45453-5_143
2001-01-01
Abstract:This paper describes a tree-based model of prosodic phrasing for Chinese text-to-speech (TTS) systems. The model uses classification and regression trees (CART) techniques to generate the decision tree automatically. We collected 559 sentences from CCTV news program and built a corresponding speech corpus uttered by a professional male announcer. The prosodic boundaries were manually marked on the recorded speech, and word identification, part-of-speech tagging and syntactic analysis were also done on the text. A decision tree was then trained on 371 sentences (of approximately 50 min length), and tested on 188 sentences (of approximately 28 min length). Features for modeling prosody are proposed, and their effectiveness is measured by interpreting the resulting tree. We achieved a success rate of about 93%.
What problem does this paper attempt to address?