Abstract:Machine learning and deep learning models are potential vectors for various attack scenarios. For example, previous research has shown that malware can be hidden in deep learning models. Hiding information in a learning model can be viewed as a form of steganography. In this research, we consider the general question of the steganographic capacity of learning models. Specifically, for a wide range of models, we determine the number of low-order bits of the trained parameters that can be overwritten, without adversely affecting model performance. For each model considered, we graph the accuracy as a function of the number of low-order bits that have been overwritten, and for selected models, we also analyze the steganographic capacity of individual layers. The models that we test include the classic machine learning techniques of Linear Regression (LR) and Support Vector Machine (SVM); the popular general deep learning models of Multilayer Perceptron (MLP) and Convolutional Neural Network (CNN); the highly-successful Recurrent Neural Network (RNN) architecture of Long Short-Term Memory (LSTM); the pre-trained transfer learning-based models VGG16, DenseNet121, InceptionV3, and Xception; and, finally, an Auxiliary Classifier Generative Adversarial Network (ACGAN). In all cases, we find that a majority of the bits of each trained parameter can be overwritten before the accuracy degrades. Of the models tested, the steganographic capacity ranges from 7.04 KB for our LR experiments, to 44.74 MB for InceptionV3. We discuss the implications of our results and consider possible avenues for further research.

Steganalysis of Neural Networks Based on Parameter Statistical Bias

Calibration-based Steganalysis for Neural Network Steganography

Steganalysis of Neural Networks Based on Symmetric Histogram Distribution

Steganalysis of AI Models LSB Attacks

Adversarial Examples Against Deep Neural Network based Steganalysis.

On the Steganographic Capacity of Selected Learning Models

Steganography of Steganographic Networks

Towards Deep Network Steganography: From Networks to Networks

Highly Accurate End-to-end Image Steganalysis Based on Auxiliary Information and Attention Mechanism

Small-Scale Linguistic Steganalysis for Multi-Concealed Scenarios

Disarming Steganography Attacks Inside Neural Network Models

Invisible Backdoor Attacks on Deep Neural Networks via Steganography and Regularization

Neural network based steganalysis in still images

A General Steganographic Framework for Neural Network Models.

A Novel Grayscale Image Steganography via Generative Adversarial Network

Deeply‐Recursive Attention Network for Video Steganography

Preemptive Image Protection against Steganography

Linguistic Steganalysis Via Densely Connected LSTM with Feature Pyramid

Exploiting Language Model for Efficient Linguistic Steganalysis

Natias: Neuron Attribution based Transferable Image Adversarial Steganography

Hiding Data in Colors: Secure and Lossless Deep Image Steganography via Conditional Invertible Neural Networks