USTC System for Blizzard Challenge 2006 an Improved HMM-based Speech Synthesis Method

Zhen-Hua Ling,Yi-Jian Wu,Yu-Ping Wang,Long Qin,Ren-Hua Wang
DOI: https://doi.org/10.21437/blizzard.2006-6
2006-01-01
Abstract:This paper introduces the USTC speech synthesis system for Blizzard Challenge 2006. The HMM-based parametric synthesis approach was adopted for its convenience and effectiveness in building a new voice, especially for the nonnative developers. Some useful techniques were also integrated into our system, such as minimum generation error (MGE) training, phone duration modeling and linear spectral pair (LSP) based formant enhancement. The evaluation results show that the proposed system is able to synthesize speech with high naturalness and intelligibility by using either full database or only ARCTIC subset.
What problem does this paper attempt to address?