Statistical Model Based on Probability Frequency for Mandarin Prosodic Structure Prediction

ZHENG Min,CAI Lianhong
DOI: https://doi.org/10.3321/j.issn:1000-0054.2006.01.021
2006-01-01
Abstract:The accuracy of prosody structure prediction in text-to-speech(TTS) conversion systems is improved by a statistical model based on the probability frequency to detect the two-tier prosodic hierarchy,including prosodic words and prosodic phrases.The system fast extracts linguistic features related to the prosodic structure such as part-of-speech,lexical words,length,and position information. Then,the probability frequency for each selected feature is calculated with statistical models designed for the prosodic words and phrases.Tests show that the correct identification rates of prosodic words and phrases are improved to 90.6% and 84.6% using the statistical model.The statistical model gives 10% better performance than the decision tree or Transformation-based learning(TBL) algorithms.
What problem does this paper attempt to address?