Emotional Speech Synthesis Based on the Modification of Prosody Parameters and Spectral Envelope

SHAO Yan-qiu,HAN Ji-qing,WANG Zhuo-ran,LIU Ting
DOI: https://doi.org/10.3969/j.issn.1003-0530.2007.04.010
IF: 4.729
2007-01-01
Signal Processing
Abstract:Emotional speech synthesis,a recently developed research subject,is expected to make the synthesized speech more expressive and human-like.Besides prosody features,voice quality and articulatory parameters are also the important factors that should be considered in emotional speech synthetic systems.Generally,rules and filters are designed to process these two kinds of parameters respectively.This paper presents that by modifying spectral envelope,the voice quality and articulatory could be adjusted as a whole. The experiments results also show that when the prosody features and spectral envelope are all modified,the best synthetic emotional speech could be got.
What problem does this paper attempt to address?