Acoustic statistical modeling based new generation speech synthesis technology

WANG Ren-Hua,DAI Li-rong,HU Yu,LING Zhen-hua
2008-01-01
Journal of University of Science and Technology of China
Abstract:This paper introduces acoustic statistical modeling based new generation speech synthesis technology.Emphasis is laid on the research progress in the field of new generation speech synthesis technology contributed by USTC iFlytek speech laboratory,which includes integration articulatory and acoustic features for improving the flexibility of acoustic parameter generation;a minimum generation error(MGE) criterion proposed to replace maximum likelihood for improving synthesized speech quality;use of unit selection and waveform concatenation to replace parametric synthesizer,thus effectively avoiding the limitation of speech quality in HMM based parametric synthesis.These technical innovations may further improve the performance of new generation speech synthesis technology in naturalness,expressiveness,flexibility and multilingual realization,etc.
What problem does this paper attempt to address?