Generative Adversarial Network for Text-to-Face Synthesis and Manipulation with Pretrained BERT Model

Yutong Zhou,Nobutaka Shimada
DOI: https://doi.org/10.1109/FG52635.2021.9666791
2021-01-01
Abstract:This work proposes a cyclic generative adversarial network with spatial-wise and channel-wise attention modules for text-to-face synthesis and manipulation. Then, we explore the pre-trained transformer-based BERT model to obtain text embedding. Furthermore, dual-layer perceptual loss and SSIM loss are introduced to reinforce the delicate features and preserve facial identity during the manipulatio...
What problem does this paper attempt to address?