Deep Learning-Based Technique for Remote Sensing Image Enhancement Using Multiscale Feature Fusion

Ming Zhao,Rui Yang,Min Hu,Botao Liu
DOI: https://doi.org/10.3390/s24020673
IF: 3.9
2024-01-21
Sensors
Abstract:The present study proposes a novel deep-learning model for remote sensing image enhancement. It maintains image details while enhancing brightness in the feature extraction module. An improved hierarchical model named Global Spatial Attention Network (GSA-Net), based on U-Net for image enhancement, is proposed to improve the model's performance. To circumvent the issue of insufficient sample data, gamma correction is applied to create low-light images, which are then used as training examples. A loss function is constructed using the Structural Similarity (SSIM) and Peak Signal-to-Noise Ratio (PSNR) indices. The GSA-Net network and loss function are utilized to restore images obtained via low-light remote sensing. This proposed method was tested on the Northwestern Polytechnical University Very-High-Resolution 10 (NWPU VHR-10) dataset, and its overall superiority was demonstrated in comparison with other state-of-the-art algorithms using various objective assessment indicators, such as PSNR, SSIM, and Learned Perceptual Image Patch Similarity (LPIPS). Furthermore, in high-level visual tasks such as object detection, this novel method provides better remote sensing images with distinct details and higher contrast than the competing methods.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper proposes a deep learning-based remote sensing image enhancement technique aimed at addressing the issues of insufficient brightness and loss of critical details in remote sensing images acquired under adverse environmental conditions (such as low light). Specifically, the study introduces an improved hierarchical model—Global Spatial Attention Network (GSA-Net), which is based on the U-Net architecture and enhances image quality through multi-scale feature fusion. #### Main Contributions: 1. **Lightweight Convolution Operations**: Utilizes Depthwise Separable Convolution, significantly reducing the number of parameters (approximately by 76%). 2. **Global Attention Module**: Introduces a global attention mechanism to mitigate noise response and integrate local information. 3. **Improved Loss Function**: Constructs a new loss function by combining Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM) metrics to avoid model optimization direction bias and gradient diffusion issues. 4. **Performance Evaluation**: Tested on a synthetic low-light image dataset, demonstrating superior performance over other advanced algorithms on multiple objective evaluation metrics (such as PSNR, SSIM, and Learned Perceptual Image Patch Similarity (LPIPS)). Additionally, in advanced vision tasks such as object detection, this method provides clearer and higher contrast remote sensing images.