F0 Prediction Model of Speech Synthesis Based on Template and Statistical Method

Jianhua Tao
DOI: https://doi.org/10.1007/978-3-540-30120-2_63
2004-01-01
Abstract:The paper describes a F0 model based on template and statistical method in speech synthesis. Being focused on the notion of templates, we confirmed that F0 patterns for a speech unit can be extracted from various anamorphosis of F0 contours in spontaneous speech. Furthermore, prosody cost function and statistical training method are used to assign and adapt the weights of template selection in real application. Unlike other methods, the approach may give feedback as to exactly what are the crucial parameters determining the successful choice of patterns. Final test proves the method in the paper can generate the synthesized speech with high naturalness, and is also much suitable to the multilingual prosody processing.
What problem does this paper attempt to address?