Abstract:Previous VoIP steganalysis methods face great challenges in detecting speech signals at low embedding rates, and they are also generally difficult to perform real-time detection, making them hard to truly maintain cyberspace security. To solve these two challenges, in this paper, combined with the sliding window detection algorithm and Convolution Neural Network we propose a real-time VoIP steganalysis method which based on multi-channel convolution sliding windows. In order to analyze the correlations between frames and different neighborhood frames in a VoIP signal, we define multi channel sliding detection windows. Within each sliding window, we design two feature extraction channels which contain multiple convolution layers with multiple convolution kernels each layer to extract correlation features of the input signal. Then based on these extracted features, we use a forward fully connected network for feature fusion. Finally, by analyzing the statistical distribution of these features, the discriminator will determine whether the input speech signal contains covert information or <a class="link-external link-http" href="http://not.We" rel="external noopener nofollow">this http URL</a> designed several experiments to test the proposed model's detection ability under various conditions, including different embedding rates, different speech length, etc. Experimental results showed that the proposed model outperforms all the previous methods, especially in the case of low embedding rate, which showed state-of-the-art performance. In addition, we also tested the detection efficiency of the proposed model, and the results showed that it can achieve almost real-time detection of VoIP speech signals.

Detection of QIM-Based Steganography in VoIP Streams: A MobileViT-Inspired Model

Universal Methodology for Developing Quantitative Steganalysis

Steganalysis of VoIP Streams with CNN-LSTM Network.

Jpeg Quantization-Distribution Steganalytic Method Attacking Jsteg

Practical Deep Learning Models for QIM-based VoIP Steganalysis

TENet: Leveraging Transformer Encoders for Steganalysis of QIM Steganography in VoIP Speech Streams

Efficient Streaming Voice Steganalysis in Challenging Detection Scenarios

Steganalysis of model based steganography and steghide in grayscale JPEG images

RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network

A co-occurrence matrix based approach to detect jpeg steganography

Real-time Steganalysis for Streaming Media Based on Multi-Channel Convolutional Sliding Windows

SPM: estimating payload locations of QIM-based steganography in low-bit-rate compressed speeches

Fast Steganalysis Method for VoIP Streams

Real-Time Steganalysis for Stream Media Based on Multi-channel Convolutional Sliding Windows

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

Hierarchical Representation Network for Steganalysis of QIM Steganography in Low-Bit-Rate Speech Signals

Frame-level steganalysis of QIM steganography in compressed speech based on multi-dimensional perspective of codeword correlations

A Covert Communication Model Based on Least Significant Bits Steganography in Voice over IP

An M-Sequence Based Steganography Model for Voice over IP

Robust Message Embedding via Attention Flow-Based Steganography

Image steganalysis with convolutional vision transformer