Abstract:Previous VoIP steganalysis methods face great challenges in detecting speech signals at low embedding rates, and they are also generally difficult to perform real-time detection, making them hard to truly maintain cyberspace security. To solve these two challenges, in this paper, combined with the sliding window detection algorithm and Convolution Neural Network we propose a real-time VoIP steganalysis method which based on multi-channel convolution sliding windows. In order to analyze the correlations between frames and different neighborhood frames in a VoIP signal, we define multi channel sliding detection windows. Within each sliding window, we design two feature extraction channels which contain multiple convolution layers with multiple convolution kernels each layer to extract correlation features of the input signal. Then based on these extracted features, we use a forward fully connected network for feature fusion. Finally, by analyzing the statistical distribution of these features, the discriminator will determine whether the input speech signal contains covert information or <a class="link-external link-http" href="http://not.We" rel="external noopener nofollow">this http URL</a> designed several experiments to test the proposed model's detection ability under various conditions, including different embedding rates, different speech length, etc. Experimental results showed that the proposed model outperforms all the previous methods, especially in the case of low embedding rate, which showed state-of-the-art performance. In addition, we also tested the detection efficiency of the proposed model, and the results showed that it can achieve almost real-time detection of VoIP speech signals.

Steganalysis of VoIP Streams with CNN-LSTM Network.

Hierarchical Representation Network for Steganalysis of QIM Steganography in Low-Bit-Rate Speech Signals

RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network

Detection of QIM-Based Steganography in VoIP Streams: A MobileViT-Inspired Model

Practical Deep Learning Models for QIM-based VoIP Steganalysis

Frame-level steganalysis of QIM steganography in compressed speech based on multi-dimensional perspective of codeword correlations

A co-occurrence matrix based approach to detect jpeg steganography

Real-time Steganalysis for Streaming Media Based on Multi-Channel Convolutional Sliding Windows

Fast Steganalysis Method for VoIP Streams

Efficient Streaming Voice Steganalysis in Challenging Detection Scenarios

TENet: Leveraging Transformer Encoders for Steganalysis of QIM Steganography in VoIP Speech Streams

Real-Time Steganalysis for Stream Media Based on Multi-channel Convolutional Sliding Windows

FCEM: A Novel Fast Correlation Extract Model for Real Time Steganalysis of VoIP Stream Via Multi-Head Attention

Detection of Heterogeneous Parallel Steganography for Low Bit-Rate VoIP Speech Streams.

SSLSS: Semi-Supervised Learning-based Steganalysis Scheme for Instant Voice Communication Network

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

SPM: estimating payload locations of QIM-based steganography in low-bit-rate compressed speeches

Text Steganalysis with Attentional LSTM-CNN

Efficient Blind Steganalysis Algorithm for QIM Encoding

Maximizing steganalysis performance using siamese networks for image

Detection of QIM steganography in G.729A encoded speech stream based on LPC filter sequence analysis