Abstract:<p>Benefitting from the strong feature extraction capability of deep learning, infrared and visible image fusion has made a great progress. Since infrared and visible images are obtained by different sensors with different imaging mechanisms, there exists domain discrepancy, which becomes stumbling block for effective fusion. In this paper, we propose a novel self-supervised feature adaption framework for infrared and visible image fusion. We implement a self-supervised strategy that facilitates the backbone network to extract features with adaption while retaining the vital information by reconstructing the source images. Specifically, we preliminary adopt an encoder network to extract features with adaption. Then, two decoders with attention mechanism blocks are utilized to reconstruct the source images in a self-supervised way, forcing the adapted features to contain vital information of the source images. Further, considering the case that source images contain low-quality information, we design a novel infrared and visible image fusion and enhancement model, improving the fusion method's robustness. Experiments are constructed to evaluate the proposed method qualitatively and quantitatively, which show that the proposed method achieves the state-of-art performance comparing with existing infrared and visible image fusion methods.</p>

What problem does this paper attempt to address?

This paper mainly discusses a core issue in the fusion of infrared and visible light images, that is, due to different imaging mechanisms and domain discrepancies caused by different sensors used for capturing the two types of images, effective fusion is hindered. To solve this problem, the paper proposes a novel self-supervised feature adaptation framework for the fusion of infrared and visible light images. The author first introduces traditional image fusion methods, such as multi-scale transformation, sparse representation, and subspace methods, and points out that these methods have limited performance in dealing with domain differences when fusing infrared and visible light images. In recent years, with the powerful feature extraction capability of deep learning, many studies have been applied to the fusion of infrared and visible light images. However, existing methods often use the same convolutional operations to adapt to domain differences, which can easily lead to the loss of important details. In the paper, the author proposes a self-supervised strategy, which uses an encoding network to extract adaptive features and then uses two decoders with attention mechanisms to reconstruct the source images in a self-supervised manner, forcing the adaptive features to contain important information from the source images. In addition, considering that the source images may contain low-quality information, they also design a new fusion and enhancement model for infrared and visible light images to improve the robustness of the method. Experimental results show that the proposed method outperforms existing infrared and visible light image fusion methods in both qualitative and quantitative evaluations. Through this self-supervised feature adaptation method, domain differences can be better handled, key information can be preserved, and the quality and visual perception of the fusion images can be improved.

Self-supervised feature adaption for infrared and visible image fusion

Interactive Feature Embedding for Infrared and Visible Image Fusion

Unsupervised end-to-end infrared and visible image fusion network using learnable fusion strategy

Infrared and Visible Image Fusion Based on a Two-Stage Class Conditioned Auto-Encoder Network.

MFST: Multi-Modal Feature Self-Adaptive Transformer for Infrared and Visible Image Fusion

Advancing infrared and visible image fusion with an enhanced multiscale encoder and attention-based networks

Infrared and Visible Image Fusion Based on Filtering Enhancement

An Infrared and Visible Image Fusion Method Based on Adaptive Weight Learning

Adaptive low light visual enhancement and high-significant target detection for infrared and visible image fusion

Interactive residual coordinate attention and contrastive learning for infrared and visible image fusion in triple frequency bands

Infrared and visible image fusion with entropy-based adaptive fusion module and mask-guided convolutional neural network

EV-Fusion: A Novel Infrared and Low-Light Color Visible Image Fusion Network Integrating Unsupervised Visible Image Enhancement

Infrared and Visible Image Fusion Method Based on Hierarchical Attention Mechanism

Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals

DCFusion: A Dual-Frequency Cross-Enhanced Fusion Network for Infrared and Visible Image Fusion.

A Deep Learning Framework for Infrared and Visible Image Fusion Without Strict Registration

Infrared and Visible Image Fusion Based on Adversarial Feature Extraction and Stable Image Reconstruction

FSADFuse: A Novel Fusion Approach to Infrared and Visible Images

A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism

SADFusion: A multi-scale infrared and visible image fusion method based on salient-aware and domain-specific