Specialized discriminators for style consistency in facial expression synthesis
Yaxin Li,Xiangjiu Che,Quanle Liu,Yan Wang
DOI: https://doi.org/10.1007/s11042-023-17994-z
IF: 2.577
2024-01-25
Multimedia Tools and Applications
Abstract:Facial expression image generation has broad applications in fields such as entertainment, AI security, image restoration, and dataset expansion. The conditional generative adversarial network (CGAN)-based generation methods have made significant progress. However, existing methods are incapable of simultaneously providing a wide range of expression categories and high-resolution facial expressions. This paper proposes a new method for generating facial expression images using semantic label maps and original images as inputs. The facial contour map is used as a semantic label map to control the information of the target expression, solving the problem of limited expression categories. In addition, this paper proposes style and identity discriminators to learn the style and identity information of the original image. To improve the image resolution, we propose a cascaded upsampling network with residual modules. Extensive experiments show that the proposed method can generate high-resolution facial images of any expression category and accurately learn the style and identity information of facial images, outperforming state-of-the-art methods
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering