Robust Message Embedding via Attention Flow-Based Steganography

Huayuan Ye,Shenzhuo Zhang,Shiqi Jiang,Jing Liao,Shuhang Gu,Dejun Zheng,Changbo Wang,Chenhui Li
2024-11-23
Abstract:Image steganography can hide information in a host image and obtain a stego image that is perceptually indistinguishable from the original one. This technique has tremendous potential in scenarios like copyright protection, information retrospection, etc. Some previous studies have proposed to enhance the robustness of the methods against image disturbances to increase their applicability. However, they generally cannot achieve a satisfying balance between the steganography quality and robustness. Instead of image-in-image steganography, we focus on the issue of message-in-image embedding that is robust to various real-world image distortions. This task aims to embed information into a natural image and the decoding result is required to be completely accurate, which increases the difficulty of data concealing and revealing. Inspired by the recent developments in transformer-based vision models, we discover that the tokenized representation of image is naturally suitable for steganography task. In this paper, we propose a novel message embedding framework, called Robust Message Steganography (RMSteg), which is competent to hide message via QR Code in a host image based on an normalizing flow-based model. The stego image derived by our method has imperceptible changes and the encoded message can be accurately restored even if the image is printed out and photoed. To our best knowledge, this is the first work that integrates the advantages of transformer models into normalizing flow. Our experiment result shows that RMSteg has great potential in robust and high-quality message embedding.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve robustness against various real - world image distortions when embedding information in images, while maintaining high - quality steganographic images and high - precision decoding results. Specifically, the author focuses on the message - in - image embedding problem, that is, how to embed information in natural images and requires completely accurate decoding results. This task increases the difficulty of data hiding and revealing, especially when facing real - world distortions (such as printing and photographing). The main contributions of the paper include: 1. Using QR codes as message carriers and proposing a conversion scheme to transform QR codes according to the characteristics of the host image before steganography, in order to improve the quality of steganographic images while maintaining decoding accuracy. 2. Proposing an Invertible Token Fusion (ITF) module, which effectively improves the steganographic quality by introducing a small learning matrix. 3. Proposing an AttnFlow, a steganographic network based on normalizing flows, which integrates the tokenized representation of images. This network can generate steganographic images with significantly higher quality and can survive extreme distortions. Experiments have verified the effectiveness of this method, especially in the cases of printing and photographing. By combining the advantages of the Transformer model and the normalizing flow technology, the paper solves the problem that existing methods are difficult to balance between quality and robustness, and realizes high - quality, high - capacity and robust message - image steganography.