Rearch on Prosodic Hierarchy Conversion for Uyghur Emotional Speech

DU Nannan,ZHAO Hui
DOI: https://doi.org/10.3778/j.issn.1002-8331.1604-0009
2016-01-01
Abstract:A prosody conversion method is proposed for transforming neutral speech to emotional speech of Uyghur. The method uses the Discrete Cosine Transform(DCT)to parameterize the emotion fundamental frequency of the Uyghur syl-lables and prosodic phrases for the first time, which combining the Uyghur prosodic features and language features. Using the Gaussian Mixture Model(GMM)to train the joint characteristics of the neutral and emotional frequency, and then syn-thesize emotional speech with neutral speed and emotional speed. The listening test results show that emotional speed is more helpful to express the emotional speech. The objective evaluation and the listening test results show that method can actualize Uyghur emotional prosody conversion effectively, the conversion results of syllables and prosodic phrases of three emotions achieve accuracy of more than 75%in listening test, and the prosodic phrases is better than that of syllables.
What problem does this paper attempt to address?