Realistic Mouth Animation Synthesis Based on Articulatory DBN Models

LIU Pei-zhen,JIANG Dong-mei
2010-01-01
Abstract:Realistic talking face and mouth animation plays an important role in virtual reality.In traditional Multi-stream Hidden Markov Model(MSHMM)based visual speech synthesis,the constructed mouth images are vague and the detailed mouth movements are ignored,as MSHMM can't model the movements of articulatory organs in speech production.A visual speech synthesis method based on articulatory dynamic Bayesian network models(AF_AVDBN is proposesd),in which the articulatory features of lips,tongue,glottis/velum can be asynchronous within a maximum constraint to describe the speech production process more reasonably.Conditional probability distributions of the nodes are defined,and a visual feature learning algorithm is deduced based on maximum likelihood estimation theory.Speech driven mouth animation experiments show that much more clear and realistic mouth images can be obtained from AF_AVDBN,comparing with those from the MSHMM models.
What problem does this paper attempt to address?