Mandarin-English Mixed Text to Speech Based on HCSIPA

Xu Ying-jin,CAI Lian-hong
DOI: https://doi.org/10.3969/j.issn.1000-3428.2013.04.004
2013-01-01
Abstract:A Mandarin-English general phonetic alphabet?——Human Computer Speech Interaction Phonetic Alphabet(HCSIPA),in which the articulation manner and place are applied for construction criterion,is proposed in bilingual speaker based Mandarin-English mixed Text to Speech(TTS).A Mandarin-English common question set suitable for HCSIPA is constructed in Mandarin-English mix clustering.Its construction target is to increase the distinguishing between Mandarin and English in the decision tree structure.The result verifies the HCSIPA can increase the mixing with pronunciation unit of Mandarin and English,to reduce the timbre gap caused by language gap.The mixed TTS based on HCSIPA can synthesize the speech of Mandarin and English with a high quality,and the quality decline to single language synthesis caused by mix of Mandarin and English is not obvious.
What problem does this paper attempt to address?