Abstract:Despite the fact that there is a remarkable achievement on multifocus image fusion, most of the existing methods only generate a low-resolution image if the given source images suffer from low resolution. Obviously, a naive strategy is to independently conduct image fusion and image super-resolution. However, this two-step approach would inevitably introduce and enlarge artifacts in the final result if the result from the first step meets artifacts. To address this problem, in this article, we propose a novel method to simultaneously achieve image fusion and super-resolution in one framework, avoiding step-by-step processing of fusion and super-resolution. Since a small receptive field can discriminate the focusing characteristics of pixels in detailed regions, while a large receptive field is more robust to pixels in smooth regions, a subnetwork is first proposed to compute the affinity of features under different types of receptive fields, efficiently increasing the discriminability of focused pixels. Simultaneously, in order to prevent from distortion, a gradient embedding-based super-resolution subnetwork is also proposed, in which the features from the shallow layer, the deep layer, and the gradient map are jointly taken into account, allowing us to get an upsampled image with high resolution. Compared with the existing methods, which implemented fusion and super-resolution independently, our proposed method directly achieves these two tasks in a parallel way, avoiding artifacts caused by the inferior output of image fusion or super-resolution. Experiments conducted on the real-world dataset substantiate the superiority of our proposed method compared with state of the arts.

SwinMFF: toward high-fidelity end-to-end multi-focus image fusion via swin transformer-based network

StackMFF: End-to-end Multi-Focus Image Stack Fusion Network

SwinFuse: A Residual Swin Transformer Fusion Network for Infrared and Visible Images

FCSwinU: Fourier Convolutions and Swin Transformer UNet for Hyperspectral and Multispectral Image Fusion

New Insights into Multi-focus Image Fusion: A Fusion Method Based on Multi-dictionary Linear Sparse Representation and Region Fusion Model

Focus Affinity Perception and Super-Resolution Embedding for Multifocus Image Fusion

Hyperspectral and multispectral remote sensing image fusion using SwinGAN with joint adaptive spatial-spectral gradient loss function

Multi-Focus Image Fusion Using U-Shaped Networks with a Hybrid Objective

Mutli-focus image fusion based on guided filter and image matting network

MSTRIQ: No Reference Image Quality Assessment Based on Swin Transformer with Multi-Stage Fusion

Multi-Focus Image Fusion Based on Multi-Scale Gradients and Image Matting

A Self-Supervised Residual Feature Learning Model for Multifocus Image Fusion

A multi-focus color image fusion algorithm based on low vision image reconstruction and focused feature extraction

SwinFG: A fine-grained recognition scheme based on swin transformer

Exploit the Best of Both End-to-End and Map-Based Methods for Multi-Focus Image Fusion

ZMFF: Zero-shot multi-focus image fusion

Multiscale Feature Interactive Network for Multifocus Image Fusion

Multi-focused image fusion algorithm based on multi-scale hybrid attention residual network

Multi-focus Image Fusion Using Fully Convolutional Two-stream Network for Visual Sensors.

Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion

Learning to Fuse Multi-Focus Image via Convolutional Network Modeling