WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning

Dongjian Huo,Zehong Zhang,Hanjing Su,Guanbin Li,Chaowei Fang,Qingyao Wu
2023-08-22
Abstract:Watermarking serves as a widely adopted approach to safeguard media copyright. In parallel, the research focus has extended to watermark removal techniques, offering an adversarial means to enhance watermark robustness and foster advancements in the watermarking field. Existing watermark removal methods mainly rely on UNet with task-specific decoder branches--one for watermark localization and the other for background image restoration. However, watermark localization and background restoration are not isolated tasks; precise watermark localization inherently implies regions necessitating restoration, and the background restoration process contributes to more accurate watermark localization. To holistically integrate information from both branches, we introduce an implicit joint learning paradigm. This empowers the network to autonomously navigate the flow of information between implicit branches through a gate mechanism. Furthermore, we employ cross-channel attention to facilitate local detail restoration and holistic structural comprehension, while harnessing nested structures to integrate multi-scale information. Extensive experiments are conducted on various challenging benchmarks to validate the effectiveness of our proposed method. The results demonstrate our approach's remarkable superiority, surpassing existing state-of-the-art methods by a large margin.
Multimedia,Computation and Language,Computer Vision and Pattern Recognition,Image and Video Processing
What problem does this paper attempt to address?
This paper aims to address the problem of digital watermark removal and proposes a new method called WMFormer++. Specifically, the paper focuses on the following points: 1. **Limitations of Existing Methods**: Existing watermark removal methods mainly rely on the UNet architecture and use task-specific decoder branches to achieve watermark localization and background image restoration separately. However, these two tasks are not isolated; accurate watermark localization itself implies the areas that need to be restored, and the background restoration process also helps in more accurately locating the watermark. 2. **Joint Learning Paradigm**: To integrate information from the two branches, the authors introduce an implicit joint learning paradigm that allows the network to autonomously navigate the flow of information between implicit branches through a gating mechanism. Additionally, a cross-channel attention mechanism is utilized to promote local detail recovery and overall structural understanding, and multi-scale information is fused through a nested structure. 3. **Experimental Validation**: Extensive experiments on multiple challenging benchmark datasets validate the effectiveness of the proposed method. The results show that this method significantly outperforms existing state-of-the-art methods in terms of performance. In summary, the main contribution of this paper is the proposal of a Transformer-based network framework that uses a single decoder branch to handle both watermark localization and background restoration tasks simultaneously, thereby overcoming the limitations of traditional multi-decoder methods and achieving outstanding results.