TH-CoSS,a Mandarin Speech Corpus for TTS

CAI Lian-hong,CUI Dan-dan,CAI Rui
2007-01-01
Abstract:This paper states our work which focuses on the building and analysis of corpus for Mandarin Text-to-Speech System,named TH-CoSS.The text script consists of four parts: sentences for TTS system building,sentences for TTS system evaluation,special syllable groups,and sentences with special sentence type to convey special intonation.The finished corpus has about 20K sentences read by one female and one male.The annotation files are in XML format,including segmental and prosodic tags.Software tools are developed as well.On the basis of the syllables in TH-CoSS,an analysis of the influences of context features on the prosody of speech is carried out.
What problem does this paper attempt to address?