Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples

Chumeng Liang,Xiaoyu Wu,Yang Hua,Jiaru Zhang,Yiming Xue,Tao Song,Zhengui Xue,Ruhui Ma,Haibing Guan
2023-06-06
Abstract:Recently, Diffusion Models (DMs) boost a wave in AI for Art yet raise new copyright concerns, where infringers benefit from using unauthorized paintings to train DMs to generate novel paintings in a similar style. To address these emerging copyright violations, in this paper, we are the first to explore and propose to utilize adversarial examples for DMs to protect human-created artworks. Specifically, we first build a theoretical framework to define and evaluate the adversarial examples for DMs. Then, based on this framework, we design a novel algorithm, named AdvDM, which exploits a Monte-Carlo estimation of adversarial examples for DMs by optimizing upon different latent variables sampled from the reverse process of DMs. Extensive experiments show that the generated adversarial examples can effectively hinder DMs from extracting their features. Therefore, our method can be a powerful tool for human artists to protect their copyright against infringers equipped with DM-based AI-for-Art applications. The code of our method is available on GitHub: <a class="link-external link-https" href="https://github.com/mist-project/mist.git" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Artificial Intelligence,Cryptography and Security,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to use adversarial examples to prevent copyright infringement in artistic creation by Diffusion Models (DMs). Specifically, in recent years, diffusion models have made remarkable progress in AI - generated art, but at the same time, they have also given rise to new copyright issues. Some infringers use unauthorized artworks to train diffusion models to generate new paintings with similar styles. In order to address these emerging copyright infringements, this paper, for the first time, explores and proposes a method of using adversarial examples to protect human - created artworks. ### Specific Problem Description 1. **Copyright Infringement Problem**: Diffusion models can learn from a small number of samples and imitate the unique styles of artists, which has led to some copyright infringement behaviors. For example, the textual inversion technique can quickly learn and imitate the style of a specific artist using a small number of samples, enabling infringers to illegally use these artworks to train models. 2. **Lack of Effective Protection Means**: Although artists have the right to declare that their works are prohibited from being used to train AI art applications, currently, there is a lack of effective technical means to prevent or track such illegal use, resulting in low infringement costs and difficult evidence - collection. ### Solutions 1. **Construct a Theoretical Framework**: This paper first constructs a theoretical framework to define and evaluate adversarial examples against diffusion models. These adversarial examples add tiny perturbations to images, making it impossible for diffusion models to extract image features or imitate artistic styles. 2. **Design an Algorithm**: Based on the above - mentioned theoretical framework, this paper designs a new algorithm named AdvDM. This algorithm uses the Monte Carlo method to estimate the objective function of adversarial examples and generates adversarial examples by optimizing different latent variables. 3. **Experimental Verification**: Through extensive experiments, this paper verifies that the generated adversarial examples can effectively prevent diffusion models from extracting features and imitating styles from images. ### Main Contributions 1. **Construct a New Framework**: This paper, for the first time, systematically studies the definition and evaluation methods of adversarial examples against diffusion models. 2. **Propose a New Algorithm**: Proposes the AdvDM algorithm for generating adversarial examples against diffusion models. 3. **Experimental Verification**: Through experiments on multiple datasets, verifies the effectiveness of the method in this paper, especially in protecting images from being learned, imitated, and copied by diffusion models. ### Technical Details - **Objective Function**: Define the objective function of adversarial examples to minimize the probability that adversarial examples are recognized as real images. - **Algorithm Implementation**: Use the Monte Carlo method to estimate the objective function and generate adversarial examples by the gradient - ascent method. - **Evaluation Method**: Evaluate the effect of adversarial examples by the quality of generated images, especially the quality of conditionally - generated images. ### Experimental Results - **Text - to - Image Generation**: On the LSUN and WikiArt datasets, the generated adversarial examples significantly increase the FID value and decrease the Precision value, indicating that the adversarial examples effectively protect the image content from being extracted as generation conditions. - **Style Transfer**: On the WikiArt dataset, the generated adversarial examples make the style of conditionally - generated images significantly different from that of the input images, indicating that the adversarial examples can effectively prevent illegal style transfer. - **Image - to - Image Synthesis**: On the Pexels dataset, the generated adversarial examples make the conditionally - generated images unrealistic, further verifying the effectiveness of the adversarial examples. Through these methods and techniques, this paper provides artists with an effective tool to protect their works from illegal use by AI art applications.