A Continuous Emotional Editing Model for Talking Head Videos Based on Decoupling Texture and Geometry

Tian LV,Yu-Hui WEN,Zhiyao SUN,Yong-Jin LIU
DOI: https://doi.org/10.1360/ssi-2022-0444
2023-01-01
Abstract:The emotional editing of talking head videos is a popular research topic in computer vision and computer graphics that aims to convert a person's talking video with neutral emotion into another talking video with a target emotion.Current methods cannot simultaneously consider high-resolution emotional editing,the maintenance of the 3D property of a human face,and adaptability for different persons.To address this problem,we propose the BFM(Basel face model)conditioned shape editing network as our shape-emotion editing module,which guarantees the feasibility of geometric editing in multiperson conditions.Furthermore,we propose the subject-classifier-based textural emotional editing module,which preserves high-fidelity facial texture in multiperson tasks.Our proposed method breaks the limitations of the previous emotional editing methods,which can only be applied to a specific person or cannot generate high-resolution results in multiperson conditions.The experiment shows that our model can achieve better clarity,identity preservation,and editing quality than previous multiperson emotional editing methods and can obtain a reasonable result on an unseen person and even an unseen head pose.Meanwhile,the experiment shows that our model can continuously control the intensity of emotional editing.
What problem does this paper attempt to address?