Abstract:With the development of natural language processing, linguistic steganography has become a research hotspot in the field of information security. However, most existing linguistic steganographic methods may suffer from the low embedding capacity problem. Therefore, this paper proposes a character-level linguistic steganographic method (CLLS) to embed the secret information into characters instead of words by employing a long short-term memory (LSTM) based language model. First, the proposed method utilizes the LSTM model and large-scale corpus to construct and train a character-level text generation model. Through training, the best evaluated model is obtained as the prediction model of generating stego text. Then, we use the secret information as the control information to select the right character from predictions of the trained character-level text generation model. Thus, the secret information is hidden in the generated text as the predicted characters having different prediction probability values can be encoded into different secret bit values. For the same secret information, the generated stego texts vary with the starting strings of the text generation model, so we design a selection strategy to find the highest quality stego text from a number of candidate stego texts as the final stego text by changing the starting strings. The experimental results demonstrate that compared with other similar methods, the proposed method has the fastest running speed and highest embedding capacity. Moreover, extensive experiments are conducted to verify the effect of the number of candidate stego texts on the quality of the final stego text. The experimental results show that the quality of the final stego text increases with the number of candidate stego texts increasing, but the growth rate of the quality will slow down.

TS-CSW: Text Steganalysis and Hidden Capacity Estimation Based on Convolutional Sliding Windows

TS-CNN: Text Steganalysis from Semantic Space Based on Convolutional Neural Network

JPEG Steganalysis Based on Co-occurrence Features and Ensemble Multiple Hyperspheres OC-SVM

A Fast and Efficient Text Steganalysis Method

Steganalysis of model based steganography and steghide in grayscale JPEG images

TS-RNN: Text Steganalysis Based on Recurrent Neural Networks

Feature-Based Steganalysis for JPEG Images

A Blind Steganalytic Scheme Based on DCT and Spatial Domain for JPEG Images.

High-Performance Linguistic Steganalysis, Capacity Estimation and Steganographic Positioning.

Text Steganalysis with Attentional LSTM-CNN

Highly Accurate End-to-end Image Steganalysis Based on Auxiliary Information and Attention Mechanism

Text Steganalysis Based on Hierarchical Supervised Learning and Dual Attention Mechanism.

Real-time Steganalysis for Streaming Media Based on Multi-Channel Convolutional Sliding Windows

Novel Linguistic Steganography Based on Character-Level Text Generation

Automatically Generate Steganographic Text Based on Markov Model and Huffman Coding.

TStego-THU: Large-Scale Text Steganalysis Dataset

Generative adversarial networks-based image steganography with multiscale features integration

CATS: Connection-Aware and Interaction-Based Text Steganalysis in Social Networks

Linguistic Steganalysis Via Densely Connected LSTM with Feature Pyramid

Image steganography without embedding by carrier secret information for secure communication in networks

Exploiting Language Model for Efficient Linguistic Steganalysis