Text Split Upon Space Silence Tag Insertion Letter To Unicode Transformation AssameseTamil Gujarati Pause after SWord Pause at the End Pause in punctuation Label Generation Context information For Tree-Based Clustering Letter Sets Text Tegulu Rajasthan

Ling-Hui Chen,Zhen-Hua Ling,Yi-Qing Zu,Run-Qiang Yan,Yuan Jiang,Xian-Jun Xia,Ying Wang
2014-01-01
Abstract:This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2014. Six Indian languages were evaluated this year, including Assamese, Gujarati, Hindi, Rajasthani, Tamil and Telugu. Two tasks were built for these languages: the mono-lingual task (IH1 hub task) and the multi-lingual task (IH2 spoken task). We submitted entries to both tasks in all languages. We submitted two entries for evaluation: the primary entry and the secondary entry. In our primary entry, a hidden Markov model (HMM)-based unit selection system was built for Hindi language and HMM-based parametric speech synthesis systems were built for the remaining five languages. In the secondary entry, only an HMM-based parametric speech synthesis system was built for Hindi language. The evaluation results show the effectiveness of our submitted systems.
What problem does this paper attempt to address?