Abstract:It is difficult to use supervised machine-learning methods for infrared (IR) and visible (VIS) image fusion (IVF) because of the shortage of ground-truth target fusion images, and image quality and contrast control are rarely considered in existing IVF methods. In this study, we proposed a simple IVF pipeline that converts the IVF problem into a supervised binary classification problem (sharp vs. blur) and uses image enhancement techniques to improve the image quality in three locations in the pipeline. We took a biological vision consistent assumption that the sharp region contains more useful information than the blurred region. A deep binary classifier based on a convolutional neural network (CNN) was designed to compare the sharpness of the infrared region and visible regions. The output score map of the deep classifier was treated as a weight map in the weighted average fusion rule. The proposed deep binary classifier was trained using natural visible images from the MS COCO dataset, rather than images from the IVF domain (called "cross domain training" here). Specifically, our proposed pipeline contains four stages: (1) enhancing the IR and VIS input images by linear transformation and the High-Dynamic-Range Compression (HDRC) method, respectively; (2) inputting the enhanced IR and VIS images to the trained CNN classifier to obtain the weight map; and (3) using a weight map to obtain the weighted average of the enhanced IR and VIS images; and (4) using single scale Retinex (SSR) to enhance the fused image to obtain the final enhanced fusion image. Extensive experimental results on public IVF datasets demonstrate the superior performance of our proposed approach over other state-of-the-art methods in terms of both subjective visual quality and 11 objective metrics. It was demonstrated that the complementary information between the infrared and visible images can be efficiently extracted using our proposed binary classifier, and the fused image quality is significantly improved. The source code is available at https://github.com/eyob12/Deep_infrared_and_visible_image_fusion .

Infrared and Visible Image Fusion Based on Dilated Residual Attention Network

Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion

An Improved Infrared and Visible Image Fusion Using an Adaptive Contrast Enhancement Method and Deep Learning Network with Transfer Learning

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals

IR-MSDNet: Infrared and Visible Image Fusion Based On Infrared Features and Multiscale Dense Network

DDFNet-A: Attention-Based Dual-Branch Feature Decomposition Fusion Network for Infrared and Visible Image Fusion

Infrared and Visible Image Fusion Based on Deep Decomposition Network and Saliency Analysis

DTFusion: Infrared and Visible Image Fusion Based on Dense Residual PConv-ConvNeXt and Texture-Contrast Compensation

Infrared-visible Image Fusion Using Accelerated Convergent Convolutional Dictionary Learning

Advancing infrared and visible image fusion with an enhanced multiscale encoder and attention-based networks

Infrared and Visible Image Fusion Based on Filtering Enhancement

An infrared and visible image fusion method based on deep learning

Infrared and visible image fusion with entropy-based adaptive fusion module and mask-guided convolutional neural network

A deep learning and image enhancement based pipeline for infrared and visible image fusion

FDNet: An end-to-end fusion decomposition network for infrared and visible images

Multi-scale Convolutional Neural Network for Multi-Focus Image Fusion.

Infrared and Visible Image Fusion with Convolutional Neural Networks.

When Image Decomposition Meets Deep Learning: A Novel Infrared and Visible Image Fusion Method

DCFusion: Dual-Headed Fusion Strategy and Contextual Information Awareness for Infrared and Visible Remote Sensing Image

A robust infrared and visible image fusion framework via multi-receptive-field attention and color visual perception

Infrared and Visible Image Fusion Based on Variational Auto-Encoder and Infrared Feature Compensation