Abstract:Infrared and visible image fusion is an essential task for multi-sensor image fusion. Generative adversarial networks (GAN) have achieved remarkable performance in the fusion of infrared and visible image. Existing GAN based fusion methods merely using infrared and visible image as input for the fusion, while we found that differential images obtained by subtraction between two image sources could provide contrast information for the fusion. To this end, a novel dual fusion path generative adversarial network (DFPGAN) is proposed in this paper for infrared and visible image fusion. We divided the generator of generative adversarial network into two fusion paths namely infrared-visible path and differential path. The input of infrared-visible path concatenated two image sources to make infrared intensity and texture details keep balance fusion in this path. The input of differential path concatenated differential images obtained by subtraction between two image sources to make contrast information fusion in this path. The features extracted by two fusion paths are concatenated at the end of the generator to generate fused images with contrast effect and balanced information distribution. Meanwhile, we have implemented dual self-attention feature refine module (DSAM) on two fusion paths to refine feature maps in two fusion paths. We adopted switchable normalization layer (SN) substitute for batch normalization layer (BN) in the generator and discriminator to avoid fusion artifact. Furthermore, a mixed content loss is integrated in the generator loss functions to guide the generated image keep balanced information distribution and preserving contrast simultaneously. The adversarial training employed dual adversarial architecture to balance the distribution of infrared intensity and texture details. To verifying the improvement effect of fusion image on target detection, we introduce the Scaled-YOLOv4 target detection framework as evaluation framework, and use the proposed network to fuse RGB images and infrared images for target detection. The results of qualitative and quantitative experiments conducted on public datasets demonstrated the superiority of proposed network over other state-of-the-art methods and could generate fused images with distinctly contrast.

Infrared and visible image fusion based on double fluid pyramids and multi-scale gradient residual block

Infrared and Visible Image Fusion Based on a Two-Stage Class Conditioned Auto-Encoder Network.

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals

FDNet: An end-to-end fusion decomposition network for infrared and visible images

Infrared and Visible Image Fusion Based on Filtering Enhancement

Infrared and visible image fusion with entropy-based adaptive fusion module and mask-guided convolutional neural network

A Dual-branch Network for Infrared and Visible Image Fusion

MGFuse: An Infrared and Visible Image Fusion Algorithm Based on Multiscale Decomposition Optimization and Gradient-Weighted Local Energy

DSG-Fusion: Infrared and visible image fusion via generative adversarial networks and guided filter

SFPFusion: An Improved Vision Transformer Combining Super Feature Attention and Wavelet-Guided Pooling for Infrared and Visible Images Fusion

An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion

HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion

SADFusion: A multi-scale infrared and visible image fusion method based on salient-aware and domain-specific

GRPAFusion: A Gradient Residual and Pyramid Attention-Based Multiscale Network for Multimodal Image Fusion

When Image Decomposition Meets Deep Learning: A Novel Infrared and Visible Image Fusion Method

Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network

DFPGAN: Dual Fusion Path Generative Adversarial Network for Infrared and Visible Image Fusion

Infrared and visible image fusion based on VPDE model and VGG network

CHFusion: A Cross-modality High-resolution Representation Framework for Infrared and Visible Image Fusion

Multi-scale infrared and visible image fusion framework based on dual partial differential equations

An infrared and visible image fusion network based on multi‐scale feature cascades and non‐local attention