Initial-Final Based Embedded Mandarin TTS System

Zhang Wanzhi,Tao Jianhua
DOI: https://doi.org/10.3969/j.issn.1003-0530.2005.z1.055
IF: 4.729
2005-01-01
Signal Processing
Abstract:Initials and finals are introduced as the basic units for corpus design and unit selection, so a bigger compression ratio becomes potentially available. CART method is used to pre-classify initial-final units with phonological and phonetic information. An improved ISODATA clustering method is presented, in which MFCC and pitch contour are respectively considered as the measures to cluster initials and finals in the leaf nodes of CART. Articulatory trunk is introduced as the spectrum smoothing unit of the system. Listening test and validation test show that the synthesis results are much close to that of the desktop system. Keyword: Embedded TTS; Initials and finals; ISODATA clustering; Coarticulation effect; Articulatory trunk
What problem does this paper attempt to address?