Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models

Wenda Li,Huijie Zhang,Qing Qu
2024-10-28
Abstract:The widespread use of AI-generated content from diffusion models has raised significant concerns regarding misinformation and copyright infringement. Watermarking is a crucial technique for identifying these AI-generated images and preventing their misuse. In this paper, we introduce Shallow Diffuse, a new watermarking technique that embeds robust and invisible watermarks into diffusion model outputs. Unlike existing approaches that integrate watermarking throughout the entire diffusion sampling process, Shallow Diffuse decouples these steps by leveraging the presence of a low-dimensional subspace in the image generation process. This method ensures that a substantial portion of the watermark lies in the null space of this subspace, effectively separating it from the image generation process. Our theoretical and empirical analyses show that this decoupling strategy greatly enhances the consistency of data generation and the detectability of the watermark. Extensive experiments further validate that our Shallow Diffuse outperforms existing watermarking methods in terms of robustness and consistency. The codes will be released at <a class="link-external link-https" href="https://github.com/liwd190019/Shallow-Diffuse" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Cryptography and Security,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on copyright protection of AI - generated content and prevention of its abuse. Specifically, with the widespread use of content generated by Diffusion Models on the Internet, these problems have become particularly prominent: 1. **False information generated by AI**: Such content may pose a serious threat to social stability by spreading unauthorized or harmful information on a large scale. 2. **Memory problem of training data**: Diffusion models may remember the training data, thus challenging the originality of the generated content and causing potential copyright infringement problems. 3. **Model collapse**: Repeatedly using AI - generated content for iterative training may lead to a decline in output quality and a reduction in diversity, further exacerbating false information and distortion in the network. To address these challenges, watermarking technology has become a crucial means to identify AI - generated content and mitigate its abuse. However, existing watermarking methods have some limitations, such as: - **Vulnerability in user scenarios**: Traditional methods are vulnerable to attacks in user scenarios, and simple image blurring can make the watermark undetectable. - **Inconsistency in server scenarios**: Although some existing methods improve robustness in server scenarios, they usually result in inconsistent generated watermark images because they significantly change the noise distribution, deviating from the Gaussian distribution. - **Lack of flexibility**: Most existing methods can only be applied to one scenario (server or user) and are difficult to extend to another scenario. For this reason, the paper proposes **Shallow Diffuse**, a new watermarking technology aimed at solving the above problems. The main features of Shallow Diffuse include: - **Flexibility**: It is applicable to both server scenarios and user scenarios. - **Efficiency**: As a non - learning method, it avoids additional optimization steps and reduces the time for watermark injection and detection. - **Consistency and robustness**: By decoupling the watermarking process from the sampling process, Shallow Diffuse achieves higher robustness and better consistency. - **Theoretical guarantee**: The consistency and detectability of this method have been theoretically proven. Through these innovations, Shallow Diffuse aims to provide a more powerful, consistent and efficient watermarking solution for various application scenarios.