Real-time speech-driven lip synchronization

Kaihui Mu,Jianhua Tao,Jianfeng Che,Minghao Yang
DOI: https://doi.org/10.1109/IUCS.2010.5666250
2010-01-01
Abstract:Speech-driven lip synchronization, an important part of facial animation, is to animate a face model to render lip movements that are synchronized with the acoustic speech signal. It has many applications in human-computer interaction. In this paper, we present a framework that systematically addresses multimodal database collection and processing and real-time speech-driven lip synchronization using collaborative filtering which is a data-driven approach used by many online retailers to recommend products. Mel-frequency cepstral coefficients (MFCCs) with their delta and acceleration coefficients and Facial Animation Parameters (FAPs) supported by MPEG-4 for the visual representation of speech are utilized as acoustic features and animation parameters respectively. The proposed system is speaker independent and real-time capable. The subjective experiments show that the proposed approach generates a natural facial animation. ©2010 IEEE.
What problem does this paper attempt to address?