ESAFormer: Multi-resolution Fusion Network for Pansharpening

Xiangzeng Liu,Rutao Li,Ziyao Wang,Ronghan Li,Qi Cheng,Qiguang Miao
DOI: https://doi.org/10.1561/116.00000174
2024-01-01
APSIPA Transactions on Signal and Information Processing
Abstract:The pansharpening task is to fuse low-resolution multispectral (LRMS) images and high-resolution panchromatic (PAN) images to generate high-resolution multispectral images. Most of the existing methods do not preserve spatial and spectral details well, which is due to ignoring the difference in resolution between the two images. To address this issue, we propose a novel fusion network (ESAFormer) that effectively enhances the spatial and spectral information representation. In the proposed model, a hybrid multi-resolution structure of CNN and Transformer is deployed to allow the features of LRMS images and PAN images to fuse progressively. Subsequently, the enhanced spatial attention module is adopted to preserve spatial details and long-range information. Extensive experimental results indicate that the proposed method is superior to existing SOTA methods on World-View2 and IKONOS datasets.
What problem does this paper attempt to address?