The USTC System for Blizzard Challenge 2008

Zhen-Hua Ling,Heng Lu,Guo-Ping Hu,Li-Rong Dai,Ren-Hua Wang,Ling-Hui Chen,Yu Hu,Li-Rong Dai,Ren-Hua Wang
2008-01-01
Abstract:This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2008. Two synthetic voices from the released UK English database are built using the HMM- based unit selection synthesis method, which is a hybrid of sta- tistical parametric synthesis and unit-selection techniques. In this method, the optimal sequence of phone-sized candidate units is selected from the database following the statistical crite- rions derived from a set of trained HMMs for different acoustic features. Then the waveforms of selected units are concatenated to generate the synthesized speech. The evaluation results of Blizzard Challenge 2008 show that our system has good per- formance on similarity, naturalness and intelligibility for both English voices. Index Terms: speech synthesis, Blizzard Challenge, unit selec- tion, hidden Markov model
What problem does this paper attempt to address?