Abstract:Aiming to address the issues of missing detailed information, the blurring of significant target information, and poor visual effects in current image fusion algorithms, this paper proposes an infrared and visible-light image fusion algorithm based on discrete wavelet transform and convolutional neural networks. Our backbone network is an autoencoder. A DWT layer is embedded in the encoder to optimize frequency-domain feature extraction and prevent information loss, and a bottleneck residual block and a coordinate attention mechanism are introduced to enhance the ability to capture and characterize the low- and high-frequency feature information; an IDWT layer is embedded in the decoder to achieve the feature reconstruction of the fused frequencies; the fusion strategy adopts the l1−norm fusion strategy to integrate the encoder's output frequency mapping features; a weighted loss containing pixel loss, gradient loss, and structural loss is constructed for optimizing network training. DWT decomposes the image into sub-bands at different scales, including low-frequency sub-bands and high-frequency sub-bands. The low-frequency sub-bands contain the structural information of the image, which corresponds to the important target information, while the high-frequency sub-bands contain the detail information, such as edge and texture information. Through IDWT, the low-frequency sub-bands that contain important target information are synthesized with the high-frequency sub-bands that enhance the details, ensuring that the important target information and texture details are clearly visible in the reconstructed image. The whole process is able to reconstruct the information of different frequency sub-bands back into the image non-destructively, so that the fused image appears natural and harmonious visually. Experimental results on public datasets show that the fusion algorithm performs well according to both subjective and objective evaluation criteria and that the fused image is clearer and contains more scene information, which verifies the effectiveness of the algorithm, and the results of the generalization experiments also show that our network has good generalization ability.

CAFNET: Cross-Attention Fusion Network for Infrared and Low Illumination Visible-Light Image

Fusion of Low-Illuminance Visible and Near-Infrared Images Based on Convolutional Neural Networks

MAFusion: Multiscale Attention Network for Infrared and Visible Image Fusion

Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion

RDCa-Net: Residual Dense Channel Attention Symmetric Network for Infrared and Visible Image Fusion

An infrared and visible image fusion network based on multi‐scale feature cascades and non‐local attention

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals

Fusion of Infrared and Visible Images based on Spatial-Channel Attentional Mechanism

Infrared and Visible Image Fusion Based on a Two-Stage Class Conditioned Auto-Encoder Network.

Multigrained Attention Network for Infrared and Visible Image Fusion

FDNet: An end-to-end fusion decomposition network for infrared and visible images

Infrared and visible image fusion with entropy-based adaptive fusion module and mask-guided convolutional neural network

A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism

A Cross-scale Iterative Attentional Adversarial Fusion Network for Infrared and Visible Images

Integrating Parallel Attention Mechanisms and Multi-Scale Features for Infrared and Visible Image Fusion

CMFA_Net: A cross-modal feature aggregation network for infrared-visible image fusion

Infrared and Visible Image Fusion with Convolutional Neural Networks.

LVIF-Net: Learning synchronous visible and infrared image fusion and enhancement under low-light conditions

MSFNet: MultiStage Fusion Network for infrared and visible image fusion

IR-MSDNet: Infrared and Visible Image Fusion Based On Infrared Features and Multiscale Dense Network

DCFNet: Infrared and Visible Image Fusion Network Based on Discrete Wavelet Transform and Convolutional Neural Network