Audio Steganography with Speech Recognition System

Hao Tan,Chenwei Liu,Yinyu Lyu,Xiao Zhang,Denghui Zhang,Zhaoquan Gu
DOI: https://doi.org/10.1109/dsc53577.2021.00042
2021-01-01
Abstract:Deep neural networks (DNNs) are vulnerable to adversarial examples that are intentionally crafted by adding small perturbations to the original input. Most works focus on generating such adversarial examples to reveal the security concerns of DNNs, while few of them explore the positive usage of the adversarial examples. In this paper, we generate adversarial audio that could fool DNNs for speech recognition, but can be utilized for audio steganography. Specifically, the generated adversarial audio contains secret information which is only recognized by DeepSpeech (an authorized speech recognition system), while unrecognizable to humans or other unauthorized speech recognition systems. Experimental results show that with an average SNR of −28.75, the adversarial audio can achieve high success ratio of 88%.
What problem does this paper attempt to address?