Multi-source Based Acoustic Model for Speech Synthesis.

JH Tao,YG Kang
DOI: https://doi.org/10.1109/icosp.2004.1452740
2004-01-01
Abstract:Traditional source-filter model has obvious limitation for speech synthesis in pitch modification due to the lack of spectrum distortion processing. To solve the problem, the paper compares spectrum features of voice source in various F0 ranges and timbres in detail, and generates multi-source (MS) based acoustic model for speech generation in various prosodies and timbres, by classifying and reconstructing voice source into different types. The model enhances the quality of speech synthesis even with strong changing of the speaking mood. It is important for future research on personalized and embedded speech synthesis system.
What problem does this paper attempt to address?