Diffusion-Stego: Training-free Diffusion Generative Steganography via Message Projection

Daegyu Kim,Chaehun Shin,Jooyoung Choi,Dahuin Jung,Sungroh Yoon
2023-05-30
Abstract:Generative steganography is the process of hiding secret messages in generated images instead of cover images. Existing studies on generative steganography use GAN or Flow models to obtain high hiding message capacity and anti-detection ability over cover images. However, they create relatively unrealistic stego images because of the inherent limitations of generative models. We propose Diffusion-Stego, a generative steganography approach based on diffusion models which outperform other generative models in image generation. Diffusion-Stego projects secret messages into latent noise of diffusion models and generates stego images with an iterative denoising process. Since the naive hiding of secret messages into noise boosts visual degradation and decreases extracted message accuracy, we introduce message projection, which hides messages into noise space while addressing these issues. We suggest three options for message projection to adjust the trade-off between extracted message accuracy, anti-detection ability, and image quality. Diffusion-Stego is a training-free approach, so we can apply it to pre-trained diffusion models which generate high-quality images, or even large-scale text-to-image models, such as Stable diffusion. Diffusion-Stego achieved a high capacity of messages (3.0 bpp of binary messages with 98% accuracy, and 6.0 bpp with 90% accuracy) as well as high quality (with a FID score of 2.77 for 1.0 bpp on the FFHQ 64$\times$64 dataset) that makes it challenging to distinguish from real images in the PNG format.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the hidden message capacity and image quality in generative steganography while maintaining the accuracy of message extraction. Specifically, although existing generative steganography methods can provide a relatively high message - hiding capacity and anti - detection ability, due to the inherent limitations of the generative models used (such as GAN or Flow models), the generated steganographic images are often not realistic enough and are easily recognized by third parties. For this reason, the paper proposes a generative steganography method based on the diffusion model - Diffusion - Stego. ### Main Problems and Solutions 1. **Insufficient Image Quality**: - **Problem**: Due to the limitations of the generative model, the steganographic images generated by existing generative steganography methods are not realistic enough and are easily detected. - **Solution**: The paper proposes using the diffusion model to generate steganographic images. The diffusion model performs excellently in image generation and can generate high - quality images, thereby improving the realism of steganographic images. 2. **Decrease in Message Extraction Accuracy**: - **Problem**: Directly hiding messages in noise will lead to visual degradation and a decrease in the accuracy of message extraction. - **Solution**: The paper introduces message projection technology. By adjusting the projection method of the message in the noise space, it solves the problems of visual degradation and the decrease in message extraction accuracy. The paper provides three message projection options, which can make trade - offs between message extraction accuracy, anti - detection ability, and image quality. 3. **Large - Capacity Message Hiding**: - **Problem**: Traditional message - hiding methods can only hide a small number of message bits in each channel, which limits the total hiding capacity. - **Solution**: The paper proposes a multi - bit projection method, which can hide multiple message bits in each channel, thereby achieving a higher message - hiding capacity. ### Method Overview - **Diffusion - Stego Framework**: - Utilize a pre - trained diffusion model to project the secret message into Gaussian noise, and then generate a steganographic image through the diffusion model. - The receiver uses the same diffusion model to extract the hidden noise and restores the original message through reverse projection. - **Message Projection Technology**: - **MN Projection**: Map the message to the Gaussian noise distribution to ensure the generated image quality. - **MB Projection**: Keep the message value as far away from the decision boundary as possible to improve the accuracy of message extraction. - **MC Projection**: Make a compromise between MN and MB projections, which ensures both image quality and improves message extraction accuracy. - **Multi - Bit Projection**: - By hiding multiple message bits in each channel, a higher message - hiding capacity is achieved. ### Experimental Results - **Comparison with Baseline Methods**: - On the FFHQ 64×64 dataset, Diffusion - Stego shows higher message extraction accuracy and better image quality at message - hiding capacities of 1.0 bpp, 2.0 bpp, and 3.0 bpp. - Compared with GSN and S2IRT, Diffusion - Stego shows stronger robustness and higher extraction accuracy when hiding high - capacity messages (such as above 3.0 bpp). - **Ablation Study**: - By comparing different message projection methods, the superiority of MB projection in high - capacity message hiding is verified, and MC projection provides a good balance between image quality and message extraction accuracy. - **High - Capacity Message Hiding**: - At message - hiding capacities of 4.0 bpp, 5.0 bpp, and 6.0 bpp, Diffusion - Stego can still generate high - quality steganographic images and maintain high message extraction accuracy. ### Conclusion The Diffusion - Stego method proposed in the paper has made significant progress in generative steganography. It not only improves the message - hiding capacity and extraction accuracy but also generates high - quality steganographic images, making steganographic images difficult to be detected by third parties. This method provides new directions and ideas for the research of generative steganography.