IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding

Pengcheng Li,Xulong Zhang,Jing Xiao,Jianzong Wang
2024-09-29
Abstract:The audio watermarking technique embeds messages into audio and accurately extracts messages from the watermarked audio. Traditional methods develop algorithms based on expert experience to embed watermarks into the time-domain or transform-domain of signals. With the development of deep neural networks, deep learning-based neural audio watermarking has emerged. Compared to traditional algorithms, neural audio watermarking achieves better robustness by considering various attacks during training. However, current neural watermarking methods suffer from low capacity and unsatisfactory imperceptibility. Additionally, the issue of watermark locating, which is extremely important and even more pronounced in neural audio watermarking, has not been adequately studied. In this paper, we design a dual-embedding watermarking model for efficient locating. We also consider the impact of the attack layer on the invertible neural network in robustness training, improving the model to enhance both its reasonableness and stability. Experiments show that the proposed model, IDEAW, can withstand various attacks with higher capacity and more efficient locating ability compared to existing methods.
Multimedia,Cryptography and Security,Sound,Audio and Speech Processing
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address several key issues in neural audio watermarking technology: 1. **Low Capacity**: Current neural audio watermarking methods generally suffer from insufficient capacity. 2. **Poor Imperceptibility**: Audio watermarks are easily detectable by the human auditory system. 3. **Low Localization Efficiency**: Existing methods require traversing the entire audio file to extract the watermark, which is time-consuming. 4. **Insufficient Robustness**: Watermarks are easily damaged when faced with various attacks (such as filtering, compression, etc.). To tackle these problems, the authors propose a new model called IDEAW (Invertible Dual-Embedding Audio Watermarking), which improves existing neural audio watermarking technology through the following innovations: - Using a two-stage Invertible Neural Network (INN) to embed localization codes and watermark information separately, thereby improving localization efficiency. - Introducing a balancing module to mitigate asymmetry caused by attack layers, enhancing the model's robustness. - Designing a new training strategy to ensure the imperceptibility and robustness of the watermark.