Abstract:Abstract Infrared and visible image fusion aims to generate synthetic images including salient targets and abundant texture details. However, traditional techniques and recent deep learning-based approaches have faced challenges in preserving prominent structures and fine-grained features. In this study, we propose a lightweight infrared and visible image fusion network utilizing multi-scale attention modules and hybrid dilated convolutional blocks to preserve significant structural features and fine-grained textural details. First, we design a hybrid dilated convolutional block with different dilation rates that enable the extraction of prominent structure features by enlarging the receptive field in the fusion network. Compared with other deep learning methods, our method can obtain more high-level semantic information without piling up a large number of convolutional blocks, effectively improving the ability of feature representation. Second, distinct attention modules are designed to integrate into different layers of the network to fully exploit contextual information of the source images, and we leverage the total loss to guide the fusion process to focus on vital regions and compensate for missing information. Extensive qualitative and quantitative experiments demonstrate the superiority of our proposed method over state-of-the-art methods in both visual effects and evaluation metrics. The experimental results on public datasets show that our method can improve the entropy (EN) by 4.80%, standard deviation (SD) by 3.97%, correlation coefficient (CC) by 1.86%, correlations of differences (SCD) by 9.98%, and multi-scale structural similarity (MS_SSIM) by 5.64%, respectively. In addition, experiments with the VIFB dataset further indicate that our approach outperforms other comparable models.

A Multi-Focus Image Fusion Network Combining Dilated Convolution with Learnable Spacings and Residual Dense Network

StackMFF: End-to-end Multi-Focus Image Stack Fusion Network

Multi-Scale Cross-Attention Fusion Network Based on Image Super-Resolution

Dual-Focal Camera Hdr Imaging Based On Convolutional Neural Network

Multi-focused image fusion algorithm based on multi-scale hybrid attention residual network

Multi-focus image fusion with deep residual learning and focus property detection

Multiscale Feature Interactive Network for Multifocus Image Fusion

DeFusionNET: Defocus Blur Detection via Recurrently Fusing and Refining Discriminative Multi-Scale Deep Features

Focus Affinity Perception and Super-Resolution Embedding for Multifocus Image Fusion

Multi-Focus Image Fusion Using U-Shaped Networks with a Hybrid Objective

MDDCMA: A Distributed Image Fusion Framework Based on Multiscale Dense Dilated Convolution and Coordinate Mean Attention

Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion

A multi-focus color image fusion algorithm based on low vision image reconstruction and focused feature extraction

Exploit the Best of Both End-to-End and Map-Based Methods for Multi-Focus Image Fusion

Multi-focus Image Fusion Using Fully Convolutional Two-stream Network for Visual Sensors.

Structural Similarity Loss for Learning to Fuse Multi-Focus Images

A Self-Supervised Residual Feature Learning Model for Multifocus Image Fusion

Boundary Aware Multi-focus Image Fusion Using Deep Neural Network.

Multi-focus image fusion with a deep convolutional neural network

MFIF-GAN: A New Generative Adversarial Network for Multi-Focus Image Fusion

New Insights into Multi-focus Image Fusion: A Fusion Method Based on Multi-dictionary Linear Sparse Representation and Region Fusion Model