Abstract:Steganography in inactive Voice-over-IP frames is a new technique of information hiding, which can achieve large steganographic capacity while maintaining excellent imperceptibility. To prevent the illegitimate use of this technique, the entropy-based and poker test-based steganalysis methods have been presented. However, the detection performance of these two methods is not so good for the cases of having small quantity of inactive frames or low embedding rates. Thus, we present a new steganalysis method based on statistic characteristics of fundamental frequency. Specifically, we employ the statistics for zero-crossing count (ZCC), including the average ZCC of inactive frames, the ratio between the average ZCC of inactive frames and that of all frames, and the difference between the average ZCC of inactive frames and their calibrated versions, to characterize the frame-level dynamic characteristic of speech signals; we utilize the average values of Mel-frequency cepstral coefficients (MFCCs) to represent the invariant characteristic of inactive frames; further, using the feature set consisting of the zero-crossing statistics and average MFCCs, we propose a support-vector-machine based steganalysis for inactive speech frames. The proposed steganalysis method is evaluated with a large number of ITU-T G.723.1 encoded speech samples, and compared with the existing methods. The experimental results demonstrate that the proposed method significantly outperforms the previous ones on detection accuracy, false positive rate and false negative rate for any given embedding rates or using the same number of inactive frames. Particularly, the proposed method can provide accurate detecting results for the existing steganographic methods only using very small quantity of inactive frames, and thereby be employed to detecting potential inactive-frame steganography behaviors in real-time speech streams.

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

Research on F5 Quantitative Steganalysis Based on Multi-Features and SVR

A New Steganalysis Approach Based on Both Complexity Estimate and Statistical Filter

Research on Holism-Based Feature Extraction and Fusion for Steganalysis

Detection of Heterogeneous Parallel Steganography for Low Bit-Rate VoIP Speech Streams.

Detection of audio-to-image audio steganography based on peak frequency feature

Real-time Steganalysis for Streaming Media Based on Multi-Channel Convolutional Sliding Windows

Real-Time Steganalysis for Stream Media Based on Multi-channel Convolutional Sliding Windows

Efficient Streaming Voice Steganalysis in Challenging Detection Scenarios

A Blind Audio Steganalysis Based on Feature Fusion

Fast Detection of Heterogeneous Parallel Steganography for Streaming Voice

RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network

Steganalysis of Adaptive Multi-Rate Speech Using Statistical Characteristics of Pitch Delay

Fast Steganalysis Method for VoIP Streams

Detecting Steganography in Inactive Voice-Over-IP Frames Based on Statistic Characteristics of Fundamental Frequency

Distributed Steganalysis of Compressed Speech.

Steganalysis of Adaptive Multi-Rate Speech Using Statistical Characteristics of Pulse Pairs

Steganalysis of VoIP Streams with CNN-LSTM Network.

A Novel Steganographic Method for Algebraic-Code-excited-linear-prediction Speech Streams Based on Fractional Pitch Delay Search

Steganalysis of Analysis-By-Synthesis Speech Exploiting Pulse-Position Distribution Characteristics

Steganalysis of Low Bit-Rate Speech Based on Statistic Characteristics of Pulse Positions