Image Attribute Modification Based on Text Guidance.

Liang Zhao,Xiaoyuan Li,Chunjiang Fu,Zhikui Chen
DOI: https://doi.org/10.1007/978-981-19-8350-4_16
2022-01-01
Abstract:The goal of this paper is to manipulate image attributes using text description. Although many methods can synthesize images with new properties from text, they cannot fully preserve the text independent content of the original image. There are two major limitations: (1) Some important details in image sub-region will be lost in the process of image modification; (2) Compared with the original image, the shape and edge of the object in the modified image will be more blurred. Therefore, we propose a novel framework edge aware generative adversarial network (EA-GAN) that uses edge information to guide image modification, which ensures the network's ability to identify local regions and realizes accurate modification of image sub-regions. At the same time, an edge reconstruction loss (ERLoss) is added to the generator to constrain the generation of edges, generate sharper edges, and improve the clarity of the image. The experimental data on the CUB and Oxford-102 datasets show that the algorithm used in this paper can well distinguish the corresponding image features in the conditional text, and modify the image attribute of specific regions in the image.
What problem does this paper attempt to address?