BLSTM Guided Unit Selection Synthesis System for Blizzard Challenge 2016

Jianhua Tao,Yibin Zheng,Zhengqi Wen,Ya Li,Biu Liu
DOI: https://doi.org/10.21437/blizzard.2016-4
2016-01-01
Abstract:The paper introduces the speech synthesis system developed by Institute of Automation, Chinese Academy of Sciences (CASIA) for Blizzard Challenge 2016. About 5 hours of speech data from professionally-produced children’s audiobooks is adopted as the training data for the construction this year. Different from our previous systems, the BLSTM guided unit selection and waveform concatenation approaches is selected to develop our speech synthesis using the provided corpus. We will describe our definitions of the acoustic, prosodic and linguistic parameters, procedure of candidate unit selection, components of cost function, etc. Finally, we will also present the results of the listening test conducted.
What problem does this paper attempt to address?