Prosodic Correlation Model in Text-to-Speech Synthesis

吴志勇,蔡莲红
DOI: https://doi.org/10.3969/j.issn.1003-0077.2004.02.007
2004-01-01
Abstract:In this paper, a new unit selection approach for concatenative Text-to-Speech (TTS) synthesis based on prosodic correlation model is proposed. Firstly, prosodic correlations in continuous speech are studied. Then, some prosodic parameters, including prosodic correlation parameters, are concluded. Thirdly, a prosodic correlation model (association rules model from data mining) is put into use in unit selection. The experiments show that the unit selection method described in this paper can improve the naturalness of the synthesized speech: the MOS score can achieve 12.22% higher than before (3.49/3.11).
What problem does this paper attempt to address?