P‐5.5: An Optimized Robust Watermarking Algorithm for Video Based on Spatio‐temporal Feature Fusion

Yisheng Fan,Jiandong Li,Limin Yan
DOI: https://doi.org/10.1002/sdtp.17233
2024-04-01
SID Symposium Digest of Technical Papers
Abstract:In recent years, there has been an increase in video piracy. One potential solution that has been suggested is to add watermarks to videos as a means of encryption. The current robust watermarking algorithms for video based on deep learning have the problems of poor spatio‐temporal feature extraction ability as well as poor temporal consistency and imperceptibility. Thus, this paper proposes an optimized robust watermarking algorithm for video based on spatio‐temporal feature fusion. For the problem of poor spatio‐temporal feature extraction ability of traditional convolutional neural network for carrier video, we improve the 3D‐UNet framework by utilizing the ResNet residual connection and EAM, and design a codec with 3D Res‐EAM UNet structure, which enhances the network's ability of extracting spatio‐temporal features of video. For the problem of poor temporal consistency and imperceptibility of watermarked video, we propose an optimized robust watermarking optimization algorithm for video based on temporal and spatial feature fusion. For the problem of poor temporal consistency and imperceptibility of watermarked video, we propose a multi‐scale discriminator, which helps to extract complete watermark information from the video by generating antagonistic effects with the codec. The experiments prove that this algorithm works well and effectively improves the robustness of the video watermarking algorithm and the ability to resist time synchronization attacks.
What problem does this paper attempt to address?