ResFPA-GAN: Text-to-Image Synthesis with Generative Adversarial Network Based on Residual Block Feature Pyramid Attention.

Jingcong Sun,Yimin Zhou,Bin Zhang
DOI: https://doi.org/10.1109/arso46408.2019.8948717
2019-01-01
Abstract:Text-to-image synthesis based on generative adversarial networks (GAN) is a challenging task. The developed methods have show prominent progress on visual quality of the synthesized images, but it still face challenge in the image synthesis of details. In this paper, we introduce an image synthesis algorithm based on semantic description and propose a residual block feature pyramid attention generative adversarial network, called ResFPA-GAN. This network introduces multiscale feature fusion by embedding feature pyramid structure to achieve the fine-grained image synthesis. The quality of the image synthesis can be improved via the iterative training of GAN, while the reference of attention can enhance the network's learning of the details of image texture. Through extensive experimental comparison on the CUB dataset, our method can achieve significant improvement on the variety and authenticity for the synthesised images.
What problem does this paper attempt to address?