ArtiFade: Learning to Generate High-quality Subject from Blemished Images

Shuya Yang,Shaozhe Hao,Yukang Cao,Kwan-Yee K. Wong
2024-09-06
Abstract:Subject-driven text-to-image generation has witnessed remarkable advancements in its ability to learn and capture characteristics of a subject using only a limited number of images. However, existing methods commonly rely on high-quality images for training and may struggle to generate reasonable images when the input images are blemished by artifacts. This is primarily attributed to the inadequate capability of current techniques in distinguishing subject-related features from disruptive artifacts. In this paper, we introduce ArtiFade to tackle this issue and successfully generate high-quality artifact-free images from blemished datasets. Specifically, ArtiFade exploits fine-tuning of a pre-trained text-to-image model, aiming to remove artifacts. The elimination of artifacts is achieved by utilizing a specialized dataset that encompasses both unblemished images and their corresponding blemished counterparts during fine-tuning. ArtiFade also ensures the preservation of the original generative capabilities inherent within the diffusion model, thereby enhancing the overall performance of subject-driven methods in generating high-quality and artifact-free images. We further devise evaluation benchmarks tailored for this task. Through extensive qualitative and quantitative experiments, we demonstrate the generalizability of ArtiFade in effective artifact removal under both in-distribution and out-of-distribution scenarios.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem that existing methods are difficult to generate high - quality images in subject - driven text - to - image generation when the input images are blemished. Specifically: 1. **Limitations of existing methods**: - Current methods usually rely on high - quality images for training. For images with blemishes (such as visible blemishes like watermarks, graffiti, stickers, and invisible blemishes like adversarial noise), these methods may not be able to generate reasonable images. - These methods lack the ability to distinguish between subject - related features and interfering blemishes, resulting in a decline in the quality of the generated images. 2. **New problems proposed**: - The paper defines a new problem: how to use blemished images for blemished subject - driven generation. This problem is very important in practical applications because obtaining blemish - free images is often very expensive or even impossible. ### Solutions To solve the above problems, the paper proposes a model named **ArtiFade**. ArtiFade mainly solves the problems in the following ways: 1. **Dataset construction**: - A special dataset is constructed, which contains blemish - free images and their corresponding blemished versions. These paired images are used to fine - tune various subject - driven generation methods. 2. **Model fine - tuning**: - ArtiFade fine - tunes the pre - trained text - to - image generation models (such as diffusion models) to align the implicit relationships between blemish - free images and blemished images. - An artifact - free embedding is introduced to enhance the fidelity of the prompts. 3. **Evaluation benchmark**: - A set of evaluation benchmarks is proposed, including multiple test sets with different blemishes and customized evaluation metrics, to accurately evaluate the performance of blemished subject - driven generation methods. 4. **Experimental verification**: - Through extensive qualitative and quantitative experiments, the effectiveness and generalization ability of ArtiFade in dealing with blemished images are proved, both in in - distribution and out - of - distribution scenarios. ### Summary The main contribution of the paper is that it solves the blemished subject - driven generation problem for the first time and proposes the ArtiFade model, which can effectively extract subject - specific information from blemished training data to generate high - quality and blemish - free images. In addition, the paper also introduces an evaluation benchmark for this task and verifies the superior performance of ArtiFade through experiments.