Unit Feature Based Pruning of Large-Scale Speech Corpus Using Decision Tree

Zhe Zhang,Lixing Huang,Jianhua Tao
DOI: https://doi.org/10.1109/icosp.2008.4697231
2008-01-01
Abstract:In this paper, we proposed and realized a corpus pruning method using decision tree. In the process of clustering, instead of conventional method, we measure the distance of pitch contours by feature vector composed by slope mean. The subjective and objective evaluation results showed that synthetic outputs based on corpus pruned through our method are close to outputs based on no-pruning corpus and are superior to conventional method with the same storage size.
What problem does this paper attempt to address?