Abstract:Recently, the intellectual properties (IP) protection of deep neural networks (DNN) has attracted serious concerns. A number of DNN copyright protection methods have been proposed. However, most of the existing DNN watermarking methods can only verify the ownership of the model after the piracy occurs, which cannot actively prevent the occurrence of the piracy and do not support users' identities management, thus can not satisfy the requirements of commercial DNN copyright management. In addition, the query modification attack which was proposed recently can invalidate most of the existing backdoor-based DNN watermarking methods. In this paper, we propose an active intellectual properties protection technique for DNN models via stealthy backdoor and users' identities authentication. For the first time, we use a set of clean images (as the watermark key samples) to embed an additional class into the DNN for ownership verification, and use the image steganography to embed users' identity information into these watermark key images. Each user will be assigned with a unique identity image for identity authentication and authorization control. Since the backdoor instances are clean images outside the dataset, the backdoor trigger is visually imperceptible and concealed. In addition, we embed the watermark by exploiting an additional class outside the main tasks, which establishes a strong connection for watermark key samples and the corresponding label. As a result, the proposed method is concealed, robust, and can resist common attacks and query modification attack. Experimental results demonstrate that, the proposed method can obtain 100% watermark accuracy and 100% fingerprint authentication success rate on Fashion-MNIST and CIFAR-10 datasets. In addition, the proposed method is demonstrated to be robust against the model fine-tuning attack, model pruning attack, and query modification attack. Compared with three existing DNN watermarking methods, the proposed method has better performance on watermark accuracy and robustness against the query modification attack.

Deep neural networks watermark via universal deep hiding and metric learning

Leveraging Unlabeled Data for Watermark Removal of Deep Neural Networks

Deep Model Intellectual Property Protection Via Deep Watermarking

Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion

Deep Neural Network Watermarking Against Model Extraction Attack

Active intellectual property protection for deep neural networks through stealthy backdoor and users’ identities authentication

Protecting the Intellectual Properties of Deep Neural Networks with an Additional Class and Steganographic Images

Persistent and Unforgeable Watermarks for Deep Neural Networks.

DeepHider: A Covert NLP Watermarking Framework Based on Multi-task Learning

Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data

Digital watermarking for deep neural networks

Protecting the Intellectual Property of Deep Neural Networks with Watermarking: The Frequency Domain Approach

On Function-Coupled Watermarks for Deep Neural Networks

Poster: On the Feasibility of Training Neural Networks with Visibly Watermarked Dataset

Embedding Watermarks into Deep Neural Networks

Removing Backdoor-Based Watermarks in Neural Networks with Limited Data.

DeepiSign-G: Generic Watermark to Stamp Hidden DNN Parameters for Self-contained Tracking

How to Prove Your Model Belongs to You: A Blind-Watermark based Framework to Protect Intellectual Property of DNN

Watermarking Neural Networks with Watermarked Images

On the Weaknesses of Backdoor-based Model Watermarking: An Information-theoretic Perspective

A Unique Identification-Oriented Black-Box Watermarking Scheme for Deep Classification Neural Networks