Abstract:Infrared and visible image fusion aims to reconstruct fused images with comprehensive visual information by merging the complementary features of source images captured by different imaging sensors. This technology has been widely used in civil and military fields, such as urban security monitoring, remote sensing measurement, and battlefield reconnaissance. However, the existing methods still suffer from the preset fusion strategies that cannot be adjustable to different fusion demands and the loss of information during the feature propagation process, thereby leading to the poor generalization ability and limited fusion performance. Therefore, we propose an unsupervised end-to-end network with learnable fusion strategy for infrared and visible image fusion in this paper. The presented network mainly consists of three parts, including the feature extraction module, the fusion strategy module, and the image reconstruction module. First, in order to preserve more information during the process of feature propagation, dense connections and residual connections are applied to the feature extraction module and the image reconstruction module, respectively. Second, a new convolutional neural network is designed to adaptively learn the fusion strategy, which is able to enhance the generalization ability of our algorithm. Third, due to the lack of ground truth in fusion tasks, a loss function that consists of saliency loss and detail loss is exploited to guide the training direction and balance the retention of different types of information. Finally, the experimental results verify that the proposed algorithm delivers competitive performance when compared with several state-of-the-art algorithms in terms of both subjective and objective evaluations. Our codes are available at https://github.com/MinjieWan/Unsupervised-end-to-end-infrared-and-visible-image-fusion-network-using-learnable-fusion-strategy.

CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion

Fusion of Low-Illuminance Visible and Near-Infrared Images Based on Convolutional Neural Networks

Infrared and Visible Image Fusion Based on a Two-Stage Class Conditioned Auto-Encoder Network.

MSFNet: MultiStage Fusion Network for infrared and visible image fusion

A Deep Learning Framework for Infrared and Visible Image Fusion Without Strict Registration

MFST: Multi-Modal Feature Self-Adaptive Transformer for Infrared and Visible Image Fusion

CGTF: Convolution-Guided Transformer for Infrared and Visible Image Fusion

HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion

HDCCT: Hybrid Densely Connected CNN and Transformer for Infrared and Visible Image Fusion

Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling

Infrared and Visible Image Fusion with Convolutional Neural Networks.

TCCFusion: An Infrared and Visible Image Fusion Method based on Transformer and Cross Correlation

Integrating Parallel Attention Mechanisms and Multi-Scale Features for Infrared and Visible Image Fusion

Unsupervised end-to-end infrared and visible image fusion network using learnable fusion strategy

LRFE-CL: A self-supervised fusion network for infrared and visible image via low redundancy feature extraction and contrastive learning

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network

HATF: Multi-Modal Feature Learning for Infrared and Visible Image Fusion via Hybrid Attention Transformer

S2CANet: A self-supervised infrared and visible image fusion based on co-attention network

Fusion of Infrared and Visible Images based on Spatial-Channel Attentional Mechanism

THFuse: An Infrared and Visible Image Fusion Network using Transformer and Hybrid Feature Extractor

Infrared-visible Image Fusion Using Accelerated Convergent Convolutional Dictionary Learning