Constructing Scalable TTS System Based on Corpus Approach

Zhang Wei,Ling Zheng-hua,Dai Li-rong
DOI: https://doi.org/10.1109/iccis.2008.4670895
2008-01-01
Abstract:Pruning redundant synthesis instances or tailoring TTS voice font is an important issue of Corpus-based TTS. But pruning redundant synthesis instances, usually results in loss of non-uniform. In order to solve this problem, this paper proposes the concept of virtual non-uniform. According to this concept and the synthesis frequency of each instance, the algorithm named StaRp-VPA is constructed as to make up the loss of non-uniform. In experiments, the naturalness scored by MOS remains almost unchanged when less than 50% instances are pruned off, and the MOS does not severely degrade when reduction rate is above 50%.
What problem does this paper attempt to address?