Acoustic Statistical Modeling Based Speech Synthesis Technologies

HU Yu,LING Zhenhua,WANG Renhua,DAI Lirong
DOI: https://doi.org/10.3969/j.issn.1003-0077.2011.06.016
2011-01-01
Abstract:This paper introduces acoustic statistical modeling based speech synthesis technologies.Emphasis is on the research progress contributed by USTC iFLYTEK speech laboratory,which includes: integrate articulatory features and acoustical features for improving the flexibility of acoustical parameters generation;propose a minimum generation error criterion to replace maximum likelihood for improving the synthesized speech quality;use unit selection and waveform concatenation to replace parametric synthesizer and avoid the limitation of speech quality in HMM based parametric synthesis.These innovative techniques improve the performance of speech synthesis systems in naturalness,expressiveness,flexibility and multilingual ability etc.These progresses have made speech synthesis technologies to be widely used in fields of information service of call center,human-machine speech interaction of mobile embedded devices and intelligent speech enabled electronic education systems.
What problem does this paper attempt to address?