O-Mamba: O-shape State-Space Model for Underwater Image Enhancement

Chenyu Dong,Chen Zhao,Weiling Cai,Bo Yang
2024-08-23
Abstract:Underwater image enhancement (UIE) face significant challenges due to complex underwater lighting conditions. Recently, mamba-based methods have achieved promising results in image enhancement tasks. However, these methods commonly rely on Vmamba, which focuses only on spatial information modeling and struggles to deal with the cross-color channel dependency problem in underwater images caused by the differential attenuation of light wavelengths, limiting the effective use of deep networks. In this paper, we propose a novel UIE framework called O-mamba. O-mamba employs an O-shaped dual-branch network to separately model spatial and cross-channel information, utilizing the efficient global receptive field of state-space models optimized for underwater images. To enhance information interaction between the two branches and effectively utilize multi-scale information, we design a Multi-scale Bi-mutual Promotion Module. This branch includes MS-MoE for fusing multi-scale information within branches, Mutual Promotion module for interaction between spatial and channel information across branches, and Cyclic Multi-scale optimization strategy to maximize the use of multi-scale information. Extensive experiments demonstrate that our method achieves state-of-the-art (SOTA) results.The code is available at <a class="link-external link-https" href="https://github.com/chenydong/O-Mamba" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the challenging issues in Underwater Image Enhancement (UIE). Specifically, the paper proposes new solutions for image enhancement tasks under complex underwater lighting conditions. #### Main Issues 1. **Cross-Color Channel Dependency Issue**: Existing methods based on Vmamba focus only on spatial information modeling, making it difficult to handle the cross-color channel dependency issue in underwater images caused by the differential attenuation of light at different wavelengths. 2. **Physical Model Limitations**: Traditional UIE methods based on physical characteristics have certain effects, but due to their limited performance and reliance on specific assumptions, they perform poorly in complex underwater environments. 3. **Insufficient Global Information Capture**: Existing CNN methods are limited by their restricted receptive fields, making it difficult to effectively capture global dependencies. Although Transformer methods can model long-range dependencies, their high computational complexity makes them unsuitable for high-resolution images. #### Proposed Solution The paper proposes a new framework called O-mamba, which adopts an O-shaped dual-branch network structure to model spatial and cross-channel information separately. To achieve information interaction between the two branches and fully utilize multi-scale information, a Multi-scale Bi-mutual Promotion Module is designed. With these improvements, O-mamba can achieve better performance in underwater image enhancement tasks.