Abstract:The widespread application of audio and video communication technology make the compressed audio data flowing over the Internet, and make it become an important carrier for covert communication. There are many steganographic schemes emerged in the mainstream audio compression data, such as AAC and MP3, followed by many steganalysis schemes. However, these steganalysis schemes are only effective in the specific embedded domain. In this paper, a general steganalysis scheme Spec-ResNet (Deep Residual Network of Spectrogram) is proposed to detect the steganography schemes of different embedding domain for AAC and MP3. The basic idea is that the steganographic modification of different embedding domain will all introduce the change of the decoded audio signal. In this paper, the spectrogram, which is the visual representation of the spectrum of frequencies of audio signal, is adopted as the input of the feature network to extract the universal features introduced by steganography schemes; Deep Neural Network Spec-ResNet is well-designed to represent the steganalysis feature; and the features extracted from different spectrogram windows are combined to fully capture the steganalysis features. The experiment results show that the proposed scheme has good detection accuracy and generality. The proposed scheme has better detection accuracy for three different AAC steganographic schemes and MP3Stego than the state-of-arts steganalysis schemes which are based on traditional hand-crafted or CNN-based feature. To the best of our knowledge, the audio steganalysis scheme based on the spectrogram and deep residual network is first proposed in this paper. The method proposed in this paper can be extended to the audio steganalysis of other codec or audio forensics.

SSLSS: Semi-Supervised Learning-based Steganalysis Scheme for Instant Voice Communication Network

Steganalysis on Internet Images Via Domain Adaptive Classifier

Hybrid Dictionary Learning for JPEG Steganalysis

Steganographic model and method with instant communication speech stream as carrier

Real-time Steganalysis for Streaming Media Based on Multi-Channel Convolutional Sliding Windows

Efficient Streaming Voice Steganalysis in Challenging Detection Scenarios

Real-Time Steganalysis for Stream Media Based on Multi-channel Convolutional Sliding Windows

Steganalysis of VoIP Streams with CNN-LSTM Network.

A Detection Method of Subliminal Channel Based on VoIP Communication.

Detection of Heterogeneous Parallel Steganography for Low Bit-Rate VoIP Speech Streams.

Distributed Steganalysis of Compressed Speech.

Fast Steganalysis Method for VoIP Streams

Hierarchical Representation Network for Steganalysis of QIM Steganography in Low-Bit-Rate Speech Signals

RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

A Blind Audio Steganalysis Based on Feature Fusion

Spec-ResNet: A General Audio Steganalysis scheme based on Deep Residual Network of Spectrogram

Steganalysis of Compressed Speech to Detect Covert Voice over Internet Protocol Channels

An Adaptive Steganography Scheme for Voice over IP.

Steganalysis of Analysis-By-Synthesis Speech Exploiting Pulse-Position Distribution Characteristics

Practical Deep Learning Models for QIM-based VoIP Steganalysis