CanvasGAN: A Simple Baseline for Text to Image Generation by Incrementally Patching a Canvas

Amanpreet Singh,Sharan Agrawal
DOI: https://doi.org/10.1007/978-3-030-17795-9_7
2019-04-24
Abstract:We propose a new recurrent generative model for generating images from text captions while attending on specific parts of text captions. Our model creates images by incrementally adding patches on a “canvas” while attending on words from text caption at each timestep. Finally, the canvas is passed through an upscaling network to generate images. We also introduce a new method for generating visual-semantic sentence embeddings based on self-attention over text. We compare our model’s generated images with those generated by Reed’s model and show that our model is a stronger baseline for text to image generation tasks.
What problem does this paper attempt to address?