Abstract:Background and objective: Due to the depth of focus (DOF) limitations of the optical systems of microscopes, it is often difficult to achieve full clarity from microscopic biomedical images under high-magnification microscopy. Multifocus microscopic biomedical image fusion (MFBIF) can effectively solve this problem. Considering both information richness and visual authenticity, this paper proposes a transformer network for MFBIF called TransFusion-Net. Methods: TransFusion-Net consists of two modules. One module is an interlayer cross-attention module, which is used to obtain feature mappings under the long-range dependencies observed among multiple nonfocus source images. The other module is a spatial attention upsampling network (SAU-Net) module, which is used to obtain global semantic information after further spatial attention is applied. Thus, TransFusion-Net can simultaneously receive multiple input images from a nonfull-focus microscope and make full use of the strong correlations between the source images to output accurate fusion results in an end-to-end manner. Results: The fusion results were quantitatively and qualitatively compared with those of eight state-of-the-art algorithms. In the quantitative experiments, five evaluation metrics, QAB/F, QMI, QAVG, QCB, and PSNR, were used to evaluate the performance of each method, and the proposed method achieved values of 0.6574, 8.4572, 5.6305, 0.7341, and 89.5685, respectively, which are higher than those of the current state-of-the-art algorithms. In the qualitative experiments, a differential image was used for further validation, and the near-zero residuals visually verified the adequacy of the proposed method for fusion. Furthermore, we showed some fusion results of multifocused biomedical microscopy images to verify the reliability of the proposed method, which shows high-quality fusion results. Conclusion: Multifocus biomedical microscopic image fusion can be accurately and effectively achieved by devising a deep convolutional neural network with joint cross-attention and spatial attention mechanisms.

MSI-DTrans: A Multi-Focus Image Fusion Using Multilayer Semantic Interaction and Dynamic Transformer

StackMFF: End-to-end Multi-Focus Image Stack Fusion Network

Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion

New Insights into Multi-focus Image Fusion: A Fusion Method Based on Multi-dictionary Linear Sparse Representation and Region Fusion Model

Multi-Focus Image Fusion Using U-Shaped Networks with a Hybrid Objective

A Novel Multiscale Transform Decomposition Based Multi-Focus Image Fusion Framework

Multiscale Feature Interactive Network for Multifocus Image Fusion

Focus Affinity Perception and Super-Resolution Embedding for Multifocus Image Fusion

MFST: Multi-Modal Feature Self-Adaptive Transformer for Infrared and Visible Image Fusion

Multi-scale Convolutional Neural Network for Multi-Focus Image Fusion.

Mutli-focus image fusion based on guided filter and image matting network

Multi-focus image fusion based on transformer and depth information learning

FusionDiff: Multi-focus image fusion using denoising diffusion probabilistic models

Multi-focus image fusion with parameter adaptive dual channel dynamic threshold neural P systems

Multi-Focus Image Fusion Based on Multi-Scale Gradients and Image Matting

TransFusion-net for multifocus microscopic biomedical image fusion

Multifocus Image Fusion Based on Discrete Cosine Transform

Multi-focus image fusion with deep residual learning and focus property detection

Multi-focus Image Fusion with Structure-Driven Adaptive Regions

Multi-feature fusion enhanced transformer with multi-layer fused decoding for image captioning

MDDCMA: A Distributed Image Fusion Framework Based on Multiscale Dense Dilated Convolution and Coordinate Mean Attention