Predicting Viseme Parameters from Speech Based on Neural Network

WANG Zhi-ming,CAI Lian-hong
DOI: https://doi.org/10.3969/j.issn.1000-1220.2005.06.048
2005-01-01
Abstract:Speech is produced by co-operation of all speech organs, and there are inherent relations between speech and movement of speech organs. To predict viseme parameters from speech using neural network, input speech parameters selection, time domain and structure of neural network were studied. Experiment results show that LPC coefficient plus short time energy are superior to other speech parameters, forward co-articulation is more server than backward co-articulation, and a delay feedback can improve the forward neural network performance. Considering experiments were based on unlimited vocabulary and continuous speech, the 0.0114 mean square error (MSE) is quite promising.
What problem does this paper attempt to address?