Ambiguity attack against text-to-image diffusion model watermarking

Zihan Yuan,Li Li,Zichi Wang,Xinpeng Zhang
DOI: https://doi.org/10.1016/j.sigpro.2024.109509
IF: 4.729
2024-04-11
Signal Processing
Abstract:In recent years, the text-to-image diffusiom models have achieved excellent performance. Among them, stable diffusion models (SDMs) have become one of the most widely used models because of their excellent performance. Scholars have proposed many model watermarking techniques to protect the copyright of the text-to-image diffusion models. In order to measure the security and potential risks of the existing text-to-image diffusion model watermarking techniques, an ambiguity attack against the text-to-image diffusion model watermarking is proposed for the first time in this paper. Specifically, we take the SDMs as an example, take advantage of the reversibility of the model watermarking and combine the ideas of adversarial examples and discrete prompt optimization to re-embed a forged watermark in the watermarked SDMs, thus confounding the watermark containing copyright information. A large number of experiments show that our ambiguity attack is effective and can make the original watermark lose its uniqueness without changing the watermarked text-to-image diffusion models.
engineering, electrical & electronic
What problem does this paper attempt to address?