Abstract:Background and objective: Due to the depth of focus (DOF) limitations of the optical systems of microscopes, it is often difficult to achieve full clarity from microscopic biomedical images under high-magnification microscopy. Multifocus microscopic biomedical image fusion (MFBIF) can effectively solve this problem. Considering both information richness and visual authenticity, this paper proposes a transformer network for MFBIF called TransFusion-Net. Methods: TransFusion-Net consists of two modules. One module is an interlayer cross-attention module, which is used to obtain feature mappings under the long-range dependencies observed among multiple nonfocus source images. The other module is a spatial attention upsampling network (SAU-Net) module, which is used to obtain global semantic information after further spatial attention is applied. Thus, TransFusion-Net can simultaneously receive multiple input images from a nonfull-focus microscope and make full use of the strong correlations between the source images to output accurate fusion results in an end-to-end manner. Results: The fusion results were quantitatively and qualitatively compared with those of eight state-of-the-art algorithms. In the quantitative experiments, five evaluation metrics, QAB/F, QMI, QAVG, QCB, and PSNR, were used to evaluate the performance of each method, and the proposed method achieved values of 0.6574, 8.4572, 5.6305, 0.7341, and 89.5685, respectively, which are higher than those of the current state-of-the-art algorithms. In the qualitative experiments, a differential image was used for further validation, and the near-zero residuals visually verified the adequacy of the proposed method for fusion. Furthermore, we showed some fusion results of multifocused biomedical microscopy images to verify the reliability of the proposed method, which shows high-quality fusion results. Conclusion: Multifocus biomedical microscopic image fusion can be accurately and effectively achieved by devising a deep convolutional neural network with joint cross-attention and spatial attention mechanisms.

MACTFusion: Lightweight Cross Transformer for Adaptive Multimodal Medical Image Fusion

Mmformer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation

Transformer-Based End-to-End Anatomical and Functional Image Fusion

Multi-Modal Image Fusion Via Deep Laplacian Pyramid Hybrid Network

MDC-RHT: Multi-Modal Medical Image Fusion via Multi-Dimensional Dynamic Convolution and Residual Hybrid Transformer

An Improved Hybrid Network With a Transformer Module for Medical Image Fusion

Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion

Multimodal Transformer for Accelerated MR Imaging

RTFusion: A Multimodal Fusion Network with Significant Information Enhancement

EMOST: A dual-branch hybrid network for medical image fusion via efficient model module and sparse transformer

Multi-Modal Transformer for Accelerated MR Imaging

Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer

FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba

AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention

Multimodal Token Fusion for Vision Transformers

TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers

TransFusion-net for multifocus microscopic biomedical image fusion

ICAFusion: Iterative cross-attention guided feature fusion for multispectral object detection

Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification

MMMViT: Multiscale multimodal vision transformer for brain tumor segmentation with missing modalities

Adaptive spatial and frequency experts fusion network for medical image fusion