Effective linguistic steganography detection

Chen Zhi-Li,Liu-sheng Huang,Yu Zhen-Shan,Zhao Xin-Xin,Zheng Xue-ling
DOI: https://doi.org/10.1109/CIT.2008.Workshops.69
2008-01-01
Abstract:Linguistic steganography is an art of concealing secret messages. More specifically, it takes advantage of the properties of natural language, such as the linguistic structure to hide messages. In this paper, an effective method for linguistic steganography detection is presented. In virtue of the concepts in area of information theory, the method uses an information-entropy-like statistical variable of words in detected text segment together with its variance as two classification features. The Support Vector Machine is used as classifier. The method was centered on detection for small size text segments estimated in the hundreds in words. Its achievement is simple and its execution is fast and relatively accurate. In our experiment of detecting the three different linguistic steganography methods: NICETEXT, TEXTO and Markov-Chain-Based, the accuracy exceeds 90%. As a result, our method can be used as a common pre-detection method followed by a more specific and accurate detection method. © 2008 IEEE. DOI 10.1109/CIT.2008.Workshops.69.
What problem does this paper attempt to address?