DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model

Liangqi Lei,Keke Gai,Jing Yu,Liehuang Zhu
2024-05-04
Abstract:Latent Diffusion Models (LDMs) enable a wide range of applications but raise ethical concerns regarding illegal utilization.Adding watermarks to generative model outputs is a vital technique employed for copyright tracking and mitigating potential risks associated with AI-generated content. However, post-hoc watermarking techniques are susceptible to evasion. Existing watermarking methods for LDMs can only embed fixed messages. Watermark message alteration requires model retraining. The stability of the watermark is influenced by model updates and iterations. Furthermore, the current reconstruction-based watermark removal techniques utilizing variational autoencoders (VAE) and diffusion models have the capability to remove a significant portion of watermarks. Therefore, we propose a novel technique called DiffuseTrace. The goal is to embed invisible watermarks in all generated images for future detection semantically. The method establishes a unified representation of the initial latent variables and the watermark information through training an encoder-decoder model. The watermark information is embedded into the initial latent variables through the encoder and integrated into the sampling process. The watermark information is extracted by reversing the diffusion process and utilizing the decoder. DiffuseTrace does not rely on fine-tuning of the diffusion model components. The watermark is embedded into the image space semantically without compromising image quality. The encoder-decoder can be utilized as a plug-in in arbitrary diffusion models. We validate through experiments the effectiveness and flexibility of DiffuseTrace. DiffuseTrace holds an unprecedented advantage in combating the latest attacks based on variational autoencoders and Diffusion Models.
Cryptography and Security,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to embed watermarks in the output of the generation model to track copyright and mitigate potential risks associated with AI - generated content. Specifically, the existing watermarking techniques have the following problems: 1. **Post - processing watermarking techniques are easily circumvented**: These techniques are vulnerable to circumvention by attackers, especially when using variational auto - encoders (VAE) and diffusion models for image reconstruction. 2. **Fixed - message embedding requires retraining the model**: The existing watermarking methods can only embed fixed messages. If the watermark information needs to be changed, the model must be retrained. 3. **The stability of the watermark is affected by model updates**: The stability of the watermark will be affected by model updates and iterations, resulting in the watermark may not be consistent in different versions of the model. 4. **The effectiveness of existing watermark removal techniques**: The existing watermark removal techniques based on VAE and diffusion models can effectively remove most watermarks, which brings new challenges to the robustness of watermarks. To solve these problems, the paper proposes a new watermarking scheme - **DiffuseTrace**. The main goal of DiffuseTrace is to embed invisible watermarks in all generated images and be able to extract these watermarks in future detections. Specifically, DiffuseTrace has the following characteristics: - **Semantic - level embedding**: The watermark information is embedded in the initial latent variables at the semantic level without affecting image quality and semantic consistency. - **Flexible modification of watermark information**: The watermark information can be flexibly modified without retraining or fine - tuning the model. - **High robustness**: The watermark has strong robustness against various image processing techniques and state - of - the - art watermark removal methods. Through these characteristics, DiffuseTrace aims to provide a transparent, flexible and robust watermarking scheme to deal with current and future possible watermark attacks.