A Multi-Focus Image Fusion Network Combining Dilated Convolution with Learnable Spacings and Residual Dense Network

Jing Fang,Xinglin Ning,Taiyong Mao,Mengting Zhang,Yuefeng Zhao,Shaohai Hu,Jingjing Wang
DOI: https://doi.org/10.1016/j.compeleceng.2024.109299
IF: 4.152
2024-01-01
Computers & Electrical Engineering
Abstract:Multi-focus image fusion is an enhancement method that aims to generate a fully focused image. However, most multi-focus image fusion algorithms do not consider the defocus spread effect (DSE) on the fused image. Moreover, there are noticeable artifacts in the fusion results at the boundaries between the focused and defocused regions, which need further improvement. To solve the above problems, this study proposes a network architecture called Dilated Residual Dense Network (DRDN), which combines the advantages of residual dense networks and dilated convolutions. Specifically dilated convolutions have a wide receptive field and can extract multi-scale features from the source image, while residual dense networks can extract deep local features in the image. By leveraging the complementary advantages of these two network structures, DRDN can extract comprehensive features of the image. Among them, dilated convolution with learnable spacings is used for dilated convolution to enhance the classification performance and robustness of DRDN. And DRDN can be easily extended to various feature extraction tasks. For severe DSE, a solution is proposed, which greatly improves severe DSE through a high-frequency enhancement and sharpening algorithm. The experiments demonstrate the superiority of our proposed architecture and achieve competitive results with state-of-the-art methods.
What problem does this paper attempt to address?