Abstract:Infrared and visible image fusion aims to reconstruct fused images with comprehensive visual information by merging the complementary features of source images captured by different imaging sensors. This technology has been widely used in civil and military fields, such as urban security monitoring, remote sensing measurement, and battlefield reconnaissance. However, the existing methods still suffer from the preset fusion strategies that cannot be adjustable to different fusion demands and the loss of information during the feature propagation process, thereby leading to the poor generalization ability and limited fusion performance. Therefore, we propose an unsupervised end-to-end network with learnable fusion strategy for infrared and visible image fusion in this paper. The presented network mainly consists of three parts, including the feature extraction module, the fusion strategy module, and the image reconstruction module. First, in order to preserve more information during the process of feature propagation, dense connections and residual connections are applied to the feature extraction module and the image reconstruction module, respectively. Second, a new convolutional neural network is designed to adaptively learn the fusion strategy, which is able to enhance the generalization ability of our algorithm. Third, due to the lack of ground truth in fusion tasks, a loss function that consists of saliency loss and detail loss is exploited to guide the training direction and balance the retention of different types of information. Finally, the experimental results verify that the proposed algorithm delivers competitive performance when compared with several state-of-the-art algorithms in terms of both subjective and objective evaluations. Our codes are available at https://github.com/MinjieWan/Unsupervised-end-to-end-infrared-and-visible-image-fusion-network-using-learnable-fusion-strategy.

Interactive residual coordinate attention and contrastive learning for infrared and visible image fusion in triple frequency bands

Infrared and visible image fusion method based on saliency detection and target-enhancement

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals

Infrared and Visible Image Fusion Based on a Two-Stage Class Conditioned Auto-Encoder Network.

Fusion of Infrared and Visible Images Via Multi-Layer Convolutional Sparse Representation

Advancing infrared and visible image fusion with an enhanced multiscale encoder and attention-based networks

Infrared and visible image fusion with entropy-based adaptive fusion module and mask-guided convolutional neural network

Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning

When Image Decomposition Meets Deep Learning: A Novel Infrared and Visible Image Fusion Method

Infrared-visible Image Fusion Based on Regional Attention Auto-Encoder

MEEAFusion: Multi-Scale Edge Enhancement and Joint Attention Mechanism Based Infrared and Visible Image Fusion

An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion

Adaptive low light visual enhancement and high-significant target detection for infrared and visible image fusion

A joint convolution auto-encoder network for infrared and visible image fusion

Infrared-visible Image Fusion Using Accelerated Convergent Convolutional Dictionary Learning

An efficient frequency domain fusion network of infrared and visible images

A Multi-scale Information Integration Framework for Infrared and Visible Image Fusion

Fusion of Infrared and Visible Images based on Spatial-Channel Attentional Mechanism

Infrared and Visible Image Fusion Based on Filtering Enhancement

Unsupervised end-to-end infrared and visible image fusion network using learnable fusion strategy

An infrared and visible image fusion network based on multi‐scale feature cascades and non‐local attention