Applying SFC Model for Chinese Expressive Speech Synthesis

Bu-Fan Zhang,Zhenhua Ling,Long Qin,Ren-Hua Wang
2006-01-01
Abstract:This paper presents an approach to model the pitch contour in Chinese expressive speech synthesis by using SFC (Superposition of Functional Contours) model. Some functional contours corresponding to the expressions are introduced when applying SFC for expressive speech. During implementation, both the emotion-dependent method and emotion-independent method are realized and compared. Three emotion types (neutral, happiness and sadness) and stress caused by narrow focus are studied in our experiments. The results show that the RMSE and correlation between predicted F0 and neutral one are satisfactory and the listening tests prove that the synthesized speech using proposed pitch model presents corresponding expressions as expected.
What problem does this paper attempt to address?