Abstract:Previous VoIP steganalysis methods face great challenges in detecting speech signals at low embedding rates, and they are also generally difficult to perform real-time detection, making them hard to truly maintain cyberspace security. To solve these two challenges, in this paper, combined with the sliding window detection algorithm and Convolution Neural Network we propose a real-time VoIP steganalysis method which based on multi-channel convolution sliding windows. In order to analyze the correlations between frames and different neighborhood frames in a VoIP signal, we define multi channel sliding detection windows. Within each sliding window, we design two feature extraction channels which contain multiple convolution layers with multiple convolution kernels each layer to extract correlation features of the input signal. Then based on these extracted features, we use a forward fully connected network for feature fusion. Finally, by analyzing the statistical distribution of these features, the discriminator will determine whether the input speech signal contains covert information or <a class="link-external link-http" href="http://not.We" rel="external noopener nofollow">this http URL</a> designed several experiments to test the proposed model's detection ability under various conditions, including different embedding rates, different speech length, etc. Experimental results showed that the proposed model outperforms all the previous methods, especially in the case of low embedding rate, which showed state-of-the-art performance. In addition, we also tested the detection efficiency of the proposed model, and the results showed that it can achieve almost real-time detection of VoIP speech signals.

Steganalysis of AMR Speech Stream Based on Multi-Domain Information Fusion

Iterative Multi-Order Feature Alignment for Jpeg Mismatched Steganalysis

Research on F5 Quantitative Steganalysis Based on Multi-Features and SVR

Steganalysis on Internet Images Via Domain Adaptive Classifier

Research on Holism-Based Feature Extraction and Fusion for Steganalysis

A New Steganalysis Approach Based on Both Complexity Estimate and Statistical Filter

Universal Methodology for Developing Quantitative Steganalysis

Steganalysis of AMR Speech Based on Multiple Classifiers Combination

Steganalysis of Adaptive Multi-Rate Speech Using Statistical Characteristics of Pulse Pairs

Steganalysis of Adaptive Multi-Rate Speech Using Statistical Characteristics of Pitch Delay

Detecting Steganography of Adaptive Multirate Speech with Unknown Embedding Rate

Blind Jpeg Steganalysis Using Features Derived from Multi-Domain

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

Real-time Steganalysis for Streaming Media Based on Multi-Channel Convolutional Sliding Windows

Efficient Streaming Voice Steganalysis in Challenging Detection Scenarios

Real-Time Steganalysis for Stream Media Based on Multi-channel Convolutional Sliding Windows

A Blind Audio Steganalysis Based on Feature Fusion

FCEM: A Novel Fast Correlation Extract Model for Real Time Steganalysis of VoIP Stream Via Multi-Head Attention

Distributed Steganalysis of Compressed Speech.

Frame-level steganalysis of QIM steganography in compressed speech based on multi-dimensional perspective of codeword correlations

Fast Steganalysis Method for VoIP Streams