Noise Weighting Phased Prompt Image Editing

Guowei Xu,Zihan Zhong,Chun Yuan
DOI: https://doi.org/10.1109/ijcnn60899.2024.10651075
2024-01-01
Abstract:The remarkable performance of large-scale Text-to-Image generation(TI) models is evident in their ability to produce high-quality and diverse images. However, despite advancements, the field of image editing still faces challenges. Current methods struggle to strike a balance between fidelity and powerful editing capabilities. Moreover, approaches that do not involve fine-tuning fail to produce diverse editing results. We introduce Noise Weighting Phased Prompt Image Editing (NWPP), a method that excels in powerful editing, high fidelity, and diverse results without fine-tuning. Our approach involves a two-phase generation process. The first phase employs the original prompt to guide initial image editing, ensuring a layout resembling the original image. In the second phase, a noise-weighting technique based on the Cross-Attention map minimizes the impact of the target text on non-editing regions. Further enhancement is achieved through the integration of the KV injection module, expanding the editing capabilities and enabling diverse result generation. Experimental evaluations, conducted on both generated images and the COCO dataset, affirm the efficacy of our method.
What problem does this paper attempt to address?