ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing

Zhihui Zhang,Jinhui Pang,Jianan Li,Xiaoshuai Hao
2024-11-07
Abstract:Multi-Image Super-Resolution (MISR) is a crucial yet challenging research task in the remote sensing community. In this paper, we address the challenging task of Multi-Image Super-Resolution in Remote Sensing (MISR-RS), aiming to generate a High-Resolution (HR) image from multiple Low-Resolution (LR) images obtained by satellites. Recently, the weak temporal correlations among LR images have attracted increasing attention in the MISR-RS task. However, existing MISR methods treat the LR images as sequences with strong temporal correlations, overlooking spatial correlations and imposing temporal dependencies. To address this problem, we propose a novel end-to-end framework named Enhancing Spatial Correlations in MISR (ESC-MISR), which fully exploits the spatial-temporal relations of multiple images for HR image reconstruction. Specifically, we first introduce a novel fusion module named Multi-Image Spatial Transformer (MIST), which emphasizes parts with clearer global spatial features and enhances the spatial correlations between LR images. Besides, we perform a random shuffle strategy for the sequential inputs of LR images to attenuate temporal dependencies and capture weak temporal correlations in the training stage. Compared with the state-of-the-art methods, our ESC-MISR achieves 0.70dB and 0.76dB cPSNR improvements on the two bands of the PROBA-V dataset respectively, demonstrating the superiority of our method.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the challenging problems of multi - image super - resolution reconstruction (MISR) in the field of remote sensing. The specific goal is to generate high - resolution (HR) images from multiple low - resolution (LR) satellite images. The following are the main problems that the paper attempts to solve: 1. **Spatio - temporal correlation problems**: - Existing MISR methods rely too much on strong temporal correlations in time series and ignore spatial correlations. - The paper points out that factors such as the time interval of remote - sensing images, cloud occlusion, and illumination changes lead to weak temporal correlations, while there is complementary spatial information between different images. 2. **Limitations of existing methods**: - Existing methods regard LR images as sequences with strong temporal correlations, ignore spatial correlations, and are sensitive to the order of input images. - For example, although RAMS and PIU attempt to overcome time - dependence, they fail to completely eliminate the influence of frame order; although TR - MISR adapts to weak temporal correlations, it does not emphasize the spatial correlations between images. To solve these problems, the paper proposes a new end - to - end framework - multi - image super - resolution with enhanced spatial correlation (ESC - MISR), which is mainly improved in the following ways: - **Introducing a new fusion module**: Multi - Image Spatial Transformer (MIST), which is used to enhance the spatial correlations between multiple LR images. - **Random shuffling strategy**: Randomly shuffle the order of LR images during the training stage to weaken time - dependence and capture weak temporal correlations. - **Encoder and decoder design**: Adopt CNNs - meet - transformers (CMT) encoder and Fast Fourier Convolution (FFC) decoder to improve feature extraction and decoding effects. These improvements enable ESC - MISR to more effectively utilize spatio - temporal relationships when processing remote - sensing images, thereby generating higher - quality HR images. Experimental results show that the performance of ESC - MISR on the PROBA - V dataset is significantly better than that of existing methods, especially with an increase of 0.70 dB and 0.76 dB in cPSNR values in the NIR and RED bands respectively.