HP-CRL: High-resolution preservation driven collaborative representation learning for infrared and visible image fusion

Jingyu Huang,Rencan Nie,Jinde Cao,Ying Zhang,Huaping Su
DOI: https://doi.org/10.1016/j.optlastec.2024.111184
IF: 4.939
2024-05-24
Optics & Laser Technology
Abstract:The Infrared and Visible Image Fusion (IVIF) task aims to generate a fused image that capitalizes on the salient features of the infrared image and the textural details of the visible image. The disparity between infrared and visible modalities has long been recognized as a significant impediment within the realm of IVIF. To address the challenge of integrating modality-specific and modality-shared features in cross-modality feature fusion, we propose a novel high-resolution preservation driven collaborative representation learning method to effectively fuse infrared and visible images, known as HP-CRL. In our model, a high-resolution preservation module is proposed to progressively extract multi-scale feature representations. More specifically, to alleviate the loss of features caused by down-sampling, we draw inspiration from the back-projection technique to continuously complement multi-resolution features while simultaneously maintaining high-resolution representations. Also, a module combining both Vision Transformer (Vit) and convolutional attention is employed to enhance the semantic representation of source images. Aiming at the problem of information redundancy during feature extraction, we employ a multi-branch transmission module for collaborative representation learning across branches and full interaction between multi-scale features. Our experiments demonstrate the efficacy of HP-CRL, surpassing other 15 state-of-the-art (SOTA) fusion methods. The results suggest the promise of our approach in achieving superior fusion quality and maintaining the salient characteristics of the source images.
optics,physics, applied
What problem does this paper attempt to address?