Overview of the IBM Mandarin Text-to-Speech System

Dan-ning Jiang,Qin Shi,Fan-ping Meng,Zhi-wei Shuang,Xi-jun Ma,Yi Liu,Yong Qin
2006-01-01
Abstract:This paper presents overview of the IBM Mandarin Text-to-Speech system. It is a concatenative speech synthesis system with a data- driven text processing module and probability-based prosody model. The TC-STAR evaluation results showed that the system is state- of-the-art both in intelligibility and naturalness, and the synthesis speech are close to natural speech in overall quality. The system also has the capability to fast-develop new voices, languages, and Chinese dialects by using data-driven methods.
What problem does this paper attempt to address?