KT-GAN: Knowledge-Transfer Generative Adversarial Network for Text-to-Image Synthesis

Hongchen Tan,Xiuping Liu,Meng Liu,Baocai Yin,Xin Li
DOI: https://doi.org/10.1109/tip.2020.3026728
IF: 10.6
2021-01-01
IEEE Transactions on Image Processing
Abstract:This paper presents a new framework, Knowledge-Transfer Generative Adversarial Network (KT-GAN), for fine-grained text-to-image generation. We introduce two novel mechanisms: an Alternate Attention-Transfer Mechanism (AATM) and a Semantic Distillation Mechanism (SDM), to help generator better bridge the cross-domain gap between text and image. The AATM updates word attention weights and attention weights of image sub-regions alternately, to progressively highlight important word information and enrich details of synthesized images. The SDM uses the image encoder trained in the Image-to-Image task to guide training of the text encoder in the Text-to-Image task, for generating better text features and higher-quality images. With extensive experimental validation on two public datasets, our KT-GAN outperforms the baseline method significantly, and also achieves the competive results over different evaluation metrics.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?