Data Hiding Based on Chinese Text Automatic Proofread

Wenfa Qi,Zongming Guo
DOI: https://doi.org/10.1109/iih-msp.2015.35
2015-01-01
Abstract:With the rapid development of network instant messaging technology, data hiding based on text carrier is increasingly becoming an important means of covert communication. In this paper, a new data hiding algorithm based on dynamically generated text carrier is provided. First, it uses big data collection technology to get massive targeted corpus sample. Next, text carrier is dynamically generated through natural language processing technology, and the repository of the right words and the wrong words is built. Then, the secret message is embedded through substitution of error words for candidate correct words after word segment of text carrier. Finally, it locates the pairs of error word and correct word using the Chinese text automatic proofread technology to extract the embedded secret message. Experimental results show that the proposed method has many advantages such as large embedding capacity, good concealment ability, high security level and small file size, etc. Accordingly, the proposed method can be widely applied in network covert communication.
What problem does this paper attempt to address?