Xiaojiao Guo,Yihang Dong,Xuhang Chen,Weiwen Chen,Zimeng Li,FuChen Zheng,Chi-Man Pun
Abstract:Underwater Image Restoration (UIR) remains a challenging task in computer vision due to the complex degradation of images in underwater environments. While recent approaches have leveraged various deep learning techniques, including Transformers and complex, parameter-heavy models to achieve significant improvements in restoration effects, we demonstrate that pure CNN architectures with lightweight parameters can achieve comparable results. In this paper, we introduce UIR-PolyKernel, a novel method for underwater image restoration that leverages Polymorphic Large Kernel CNNs. Our approach uniquely combines large kernel convolutions of diverse sizes and shapes to effectively capture long-range dependencies within underwater imagery. Additionally, we introduce a Hybrid Domain Attention module that integrates frequency and spatial domain attention mechanisms to enhance feature importance. By leveraging the frequency domain, we can capture hidden features that may not be perceptible to humans but are crucial for identifying patterns in both underwater and on-air images. This approach enhances the generalization and robustness of our UIR model. Extensive experiments on benchmark datasets demonstrate that UIR-PolyKernel achieves state-of-the-art performance in underwater image restoration tasks, both quantitatively and qualitatively. Our results show that well-designed pure CNN architectures can effectively compete with more complex models, offering a balance between performance and computational efficiency. This work provides new insights into the potential of CNN-based approaches for challenging image restoration tasks in underwater environments. The code is available at \href{<a class="link-external link-https" href="https://github.com/CXH-Research/UIR-PolyKernel" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/CXH-Research/UIR-PolyKernel" rel="external noopener nofollow">this https URL</a>}.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the complex degradation problem in underwater image restoration (UIR). Specifically, images captured in underwater environments are usually severely degraded, including problems such as reduced visibility, color distortion, and blurred details, which hinder the effectiveness of underwater imaging systems. Although existing methods, such as traditional UIR methods (e.g., UNTV, MLLE, ROP, etc.) and deep - learning - based methods (e.g., Ucolor, CLUIE - Net, SFGNet, etc.), have solved these problems to a certain extent, they are often difficult to generalize in different types of water bodies and complex underwater scenes, and many high - performance methods rely on complex architectures and high computational costs, limiting their practical applications in resource - constrained environments.
To solve the above problems, this paper proposes a new method named UIR - PolyKernel. This method aims to effectively capture long - range dependencies in underwater images while maintaining a lightweight parameter design by introducing polymorphic large - kernel convolutional neural networks (Polymorphic Large Kernel CNNs). In addition, the paper also introduces a hybrid - domain attention module (Hybrid Domain Attention module), which combines the attention mechanisms in the frequency domain and the spatial domain to enhance the importance of features and capture hidden features. This method not only improves the generalization ability and robustness of the model but also achieves state - of - the - art performance on multiple benchmark datasets, surpassing existing methods both quantitatively and qualitatively.
The following are the key contributions of the paper:
1. **Proposing UIR - PolyKernel**: A lightweight and computationally efficient pure CNN architecture that achieves state - of - the - art performance in the underwater image restoration task by integrating large - kernel convolutions of multiple sizes and shapes.
2. **Introducing the hybrid - domain attention module (HDA)**: Combining the attention mechanisms in the frequency domain and the spatial domain, enhancing the importance of features, capturing hidden features that may be imperceptible in the spatial domain, and thus improving the ability to restore fine - grained details.
3. **Extensive experimental verification**: Through experiments on multiple benchmark datasets, it is proved that UIR - PolyKernel always outperforms existing methods in performance, including more complex models, challenging the view that complex, heavy - weight models are necessary in this task.
In summary, this paper aims to provide a new method that can ensure performance and computational efficiency in the underwater image restoration task through a well - designed CNN architecture.