DLOVE: A new Security Evaluation Tool for Deep Learning Based Watermarking Techniques

Sudev Kumar Padhi,Sk. Subidh Ali
2024-07-09
Abstract:Recent developments in Deep Neural Network (DNN) based watermarking techniques have shown remarkable performance. The state-of-the-art DNN-based techniques not only surpass the robustness of classical watermarking techniques but also show their robustness against many image manipulation techniques. In this paper, we performed a detailed security analysis of different DNN-based watermarking techniques. We propose a new class of attack called the Deep Learning-based OVErwriting (DLOVE) attack, which leverages adversarial machine learning and overwrites the original embedded watermark with a targeted watermark in a watermarked image. To the best of our knowledge, this attack is the first of its kind. We have considered scenarios where watermarks are used to devise and formulate an adversarial attack in white box and black box settings. To show adaptability and efficiency, we launch our DLOVE attack analysis on seven different watermarking techniques, HiDDeN, ReDMark, PIMoG, Stegastamp, Aparecium, Distortion Agostic Deep Watermarking and Hiding Images in an Image. All these techniques use different approaches to create imperceptible watermarked images. Our attack analysis on these watermarking techniques with various constraints highlights the vulnerabilities of DNN-based watermarking. Extensive experimental results validate the capabilities of DLOVE. We propose DLOVE as a benchmark security analysis tool to test the robustness of future deep learning-based watermarking techniques.
Cryptography and Security,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the security issue of the current digital watermarking technology based on deep neural networks (DNN) in copyright protection applications. Specifically, the paper focuses on how to evaluate and verify the robustness of these DNN watermarking techniques through a new attack method - Deep Learning - based OVErwriting (DLOVE). The goal of the DLOVE attack is to overwrite the original watermark with the target watermark in the watermarked image by adding carefully designed perturbations. This attack is particularly important for copyright protection in real - world scenarios because an attacker not only needs to remove the original watermark but also be able to replace it with their own watermark, thereby illegally claiming ownership of the digital content. The main contributions of the paper include: 1. **Proposing the DLOVE attack**: This is the first method based on the concept of Targeted Adversarial Machine Learning (AML) for overwriting the target watermark in watermarked images. 2. **Introducing a new attack category**: By using the knowledge available when using DNN watermarking techniques for copyright protection, a completely new attack category is proposed. 3. **Experimental verification**: Detailed experimental results are provided to verify the success rate of the DLOVE attack and demonstrate its generalization ability on different DNN watermarking techniques. Through these efforts, the paper not only reveals the security vulnerabilities of existing DNN watermarking techniques but also provides an important security evaluation tool for future watermarking technique designs.