TMFIF:Transformer-based Multi-Focus Image Fusion

Rui Li,Shengling Geng,Dan Zhang,Mingquan Zhou
DOI: https://doi.org/10.1109/cvidl62147.2024.10603893
2024-01-01
Abstract:Multi-focus image fusion is a hot topic in the field of image processing, and it is a fundamental problem in the fields of image editing, image synthesis, and target retrieval. In previous fusion methods, although feature-rich datasets, models, and algorithms have been provided, there are still many problems with the effective fusion of distant and near views in complex backgrounds. To solve the challenging multi-focus image fusion problem more accurately, we introduce the Transformer Network based on Encoder Decoder (TMFIF), which can extract more generalized features based on the features of multi-focused images to achieve image fusion in complex backgrounds for better visual effects. In this work, we achieve fusion by inputting two images, near and far view. We compare the performance with other multifocal image fusion algorithms by conducting experiments on publicly available datasets and illustrated using existing evaluation methods and evaluation metrics; the results of the experiments show that our method visualizes the fusion effect better through the encoder and the decoder and the evaluation metrics are also relatively good.
What problem does this paper attempt to address?