Diffusion Brush: A Latent Diffusion Model-based Editing Tool for AI-generated Images

Peyman Gholami,Robert Xiao
2023-10-27
Abstract:Text-to-image generative models have made remarkable advancements in generating high-quality images. However, generated images often contain undesirable artifacts or other errors due to model limitations. Existing techniques to fine-tune generated images are time-consuming (manual editing), produce poorly-integrated results (inpainting), or result in unexpected changes across the entire image (variation selection and prompt fine-tuning). In this work, we present Diffusion Brush, a Latent Diffusion Model-based (LDM) tool to efficiently fine-tune desired regions within an AI-synthesized image. Our method introduces new random noise patterns at targeted regions during the reverse diffusion process, enabling the model to efficiently make changes to the specified regions while preserving the original context for the rest of the image. We evaluate our method's usability and effectiveness through a user study with artists, comparing our technique against other state-of-the-art image inpainting techniques and editing software for fine-tuning AI-generated imagery.
Computer Vision and Pattern Recognition,Artificial Intelligence,Computation and Language,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the challenges encountered in making local fine adjustments in AI-generated images. Specifically: 1. **Problems with existing image editing techniques**: - **Manual editing**: Time-consuming and requires a high level of technical skill. - **Inpainting**: The generated results have poor integration with the original image, often leading to unnatural effects. - **Global changes**: When fine-tuning through text prompts or selecting variants, it may cause unexpected changes to the entire image. 2. **Objective**: - Propose a tool based on the Latent Diffusion Model (LDM) — Diffusion Brush, which can efficiently make local adjustments in AI-generated images while preserving the original context of other parts of the image. 3. **Method**: - **Core mechanism**: Introduce new random noise patterns to the specified area during the reverse diffusion process and combine them with the latent representation of the original image to achieve precise modifications to the specified area. - **User interface**: Provide a user-friendly interface that allows users to create and configure multiple editing masks, control noise intensity, and the steps of noise introduction. 4. **Evaluation**: - Through user studies, compare Diffusion Brush with other common image repair and editing tools (such as Adobe Photoshop's content-aware fill and inpainting methods) in terms of user experience, efficiency, and effectiveness. Overall, the paper aims to improve the editing experience of AI-generated images by providing an efficient, easy-to-use tool capable of making local fine adjustments through Diffusion Brush.