Abstract:Medical imaging is indispensable for accurate diagnosis and effective treatment, with modalities like MRI and CT providing diverse yet complementary information. Traditional image fusion methods, while essential in consolidating information from multiple modalities, often suffer from poor image quality and loss of crucial details due to inadequate handling of semantic information and limited feature extraction capabilities. This paper introduces a novel medical image fusion technique leveraging unsupervised image segmentation to enhance the semantic understanding of the fusion process. The proposed method, named DUSMIF, employs a multi-branch, multi-scale deep learning architecture that integrates advanced attention mechanisms to refine the feature extraction and fusion processes. An innovative approach that utilizes unsupervised image segmentation to extract semantic information is introduced, which is then integrated into the fusion process. This not only enhances the semantic relevance of the fused images but also improves the overall fusion quality. The paper proposes a sophisticated network structure that extracts and fuses features at multiple scales and across multiple branches. This structure is designed to capture a comprehensive range of image details and contextual information, significantly improving the fusion outcomes. Multiple attention mechanisms are incorporated to selectively emphasize important features and integrate them effectively across different modalities and scales. This approach ensures that the fused images maintain high quality and detail fidelity. A joint loss function combining content loss, structural similarity loss, and semantic loss is formulated. This function not only guides the network in preserving image brightness and texture but also ensures that the fused image closely resembles the source images in both content and structure. The proposed method demonstrates superior performance over existing fusion techniques in objective assessments and subjective evaluations, confirming its effectiveness in enhancing the diagnostic utility of fused medical images.

Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer

MDC-RHT: Multi-Modal Medical Image Fusion via Multi-Dimensional Dynamic Convolution and Residual Hybrid Transformer

CIRF: Coupled Image Reconstruction and Fusion Strategy for Deep Learning Based Multi-Modal Image Fusion

Multi-Modal Image Fusion Via Deep Laplacian Pyramid Hybrid Network

MM-Net: A MixFormer-Based Multi-Scale Network for Anatomical and Functional Image Fusion

DFENet: A dual-branch feature enhanced network integrating transformers and convolutional feature learning for multimodal medical image fusion

Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion

DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network

Multi-Modal Medical Image Fusion Based on FusionNet in YIQ Color Space

An Attention-based Multi-Scale Feature Learning Network for Multimodal Medical Image Fusion

A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion

EMOST: A dual-branch hybrid network for medical image fusion via efficient model module and sparse transformer

An Improved Hybrid Network With a Transformer Module for Medical Image Fusion

A Medical Image Fusion Method Based on Convolutional Neural Networks

AMMNet: A multimodal medical image fusion method based on an attention mechanism and MobileNetV3

A Local-Global Attention Fusion Framework with Tensor Decomposition for Medical Diagnosis

Transformer-Based End-to-End Anatomical and Functional Image Fusion

Multimodal Medical Supervised Image Fusion Method by CNN

Multimodal MRI Volumetric Data Fusion With Convolutional Neural Networks

Feature extraction of multimodal medical image fusion using novel deep learning and contrast enhancement method

Sub-pixel multi-scale fusion network for medical image segmentation