Formant Speech Synthesis Based on Trainable Model

Zhiping Zhang,Xihong Wu
DOI: https://doi.org/10.4028/www.scientific.net/amm.303-306.1334
2013-01-01
Applied Mechanics and Materials
Abstract:The authors proposed a trainable formant synthesis method based on the multi-channel Hidden Trajectory Model (HTM). In the method, the phonetic targets, formant trajectories and spectrum states from the oral, nasal, voiceless and background channels were designed to construct hierarchical hidden layers, and then spectrum were generated as observable features. In model training, the phonemic targets were learned from one-hour training speech data and the boundaries of phonemes were also aligned. The experimental results showed that the speech could be reconstructed with the formant trainable model by a source-filter synthesizer.
What problem does this paper attempt to address?