Speech Driven Face Animation based on Dynamic Concatenation Model

Jianhua Tao,Panrong Yin
2007-01-01
Abstract:In the paper, we design and develop a speech driven face animation system based on the dynamic concatenation model. The face animation is synthesized by the unit concatenating, and synchronous with the real speech. The units are selected according to the cost functions which correspond to voice spectrum distance between training and target units. Visual distance between two adjacent training units is also used to get better mapping results. Finally, the Viterbi method is used to find out the best face animation sequence. The experimental results show that synthesized lip movement has a good and natural quality.
What problem does this paper attempt to address?