Sliding Window Smoothing for Maximum Entropy Based Intonational Phrase Prediction in Chinese
JF Li,GP Hu,RH Wang,LR Dai
DOI: https://doi.org/10.1109/icassp.2005.1415106
2005-01-01
Abstract:In Chinese TTS (Text-To-Speech) system, Intonational phrase prediction has great influence on naturalness of synthesized speech. Different kinds of statistic models have been applied to this domain, and achieved good performance. In this paper, we first build a maximum entropy model to yield the probability of each word boundary to be an intonational phrase break, and then a sliding window smoothing algorithm is proposed, in which the length distribution curve of intonational phrase acts as the sliding window. The maximum entropy model and the distribution curve are trained from 19,000 sentences and tested on a test set of 1,000 sentences. Experiment results shows that, the sliding window smoothing algorithm makes an improvement of 5.3% in terms of F-Score, 10.0% in terms of average score, and 55.6% in tern-is of unacceptable rate. From the results, we draw the conclusion that the length distribution information is of great usefulness for intonational phrase break prediction,. and the sliding window smoothing method is quite effective to improve the performance significantly.