Refining segmental boundaries for TTS database using fine contextual-dependent boundary models

Lijuan Wang,Yong Zhao,Min Chu,Jianlai Zhou,Zhigang Cao
DOI: https://doi.org/10.1109/ICASSP.2004.1326067
2004-01-01
Abstract:This paper proposed a post-refining method with fine contextual-dependent GMM for the auto-segmentation task. A GMM trained with a super feature vector extracted from multiple evenly spaced frames near the boundary is suggested to describe the waveform evolution across a boundary. CART is used to cluster acoustically similar GMM, so that the GMM for each leaf node is reliably trained by the limited manually labeled boundaries. An accuracy of 90% is thus achieved when only 250 manually labeled sentences are provided to train the refining models.
What problem does this paper attempt to address?