Abstract:Background: The information between multimodal magnetic resonance imaging (MRI) is complementary. Combining multiple modalities for brain tumor image segmentation can improve segmentation accuracy, which has great significance for disease diagnosis and treatment. However, different degrees of missing modality data often occur in clinical practice, which may lead to serious performance degradation or even failure of brain tumor segmentation methods relying on full-modality sequences to complete the segmentation task. To solve the above problems, this study aimed to design a new deep learning network for incomplete multimodal brain tumor segmentation. Methods: We propose a novel cross-modal attention fusion-based deep neural network (CMAF-Net) for incomplete multimodal brain tumor segmentation, which is based on a three-dimensional (3D) U-Net architecture with encoding and decoding structure, a 3D Swin block, and a cross-modal attention fusion (CMAF) block. A convolutional encoder is initially used to extract the specific features from different modalities, and an effective 3D Swin block is constructed to model the long-range dependencies to obtain richer information for brain tumor segmentation. Then, a cross-attention based CMAF module is proposed that can deal with different missing modality situations by fusing features between different modalities to learn the shared representations of the tumor regions. Finally, the fused latent representation is decoded to obtain the final segmentation result. Additionally, channel attention module (CAM) and spatial attention module (SAM) are incorporated into the network to further improve the robustness of the model; the CAM to help focus on important feature channels, and the SAM to learn the importance of different spatial regions. Results: Evaluation experiments on the widely-used BraTS 2018 and BraTS 2020 datasets demonstrated the effectiveness of the proposed CMAF-Net which achieved average Dice scores of 87.9%, 81.8%, and 64.3%, as well as Hausdorff distances of 4.21, 5.35, and 4.02 for whole tumor, tumor core, and enhancing tumor on the BraTS 2020 dataset, respectively, outperforming several state-of-the-art segmentation methods in missing modalities situations. Conclusions: The experimental results show that the proposed CMAF-Net can achieve accurate brain tumor segmentation in the case of missing modalities with promising application potential.

Feature-enhanced generation and multi-modality fusion based deep neural network for brain tumor segmentation with missing MR modalities

Feature fusion and latent feature learning guided brain tumor segmentation and missing modality recovery network

Conditional generator and multi-sourcecorrelation guided brain tumor segmentation with missing MR modalities

Modality-level cross-connection and attentional feature fusion based deep neural network for multi-modal brain tumor segmentation

Latent Correlation Representation Learning for Brain Tumor Segmentation with Missing MRI Modalities

Mixture-of-experts and semantic-guided network for brain tumor segmentation with missing MRI modalities

A multi-modality fusion network based on attention mechanism for brain tumor segmentation

Multi -Modality Brain Tumor Segmentation Network Based on Collaborative Feature Fusion

M3AE: Multimodal Representation Learning for Brain Tumor Segmentation with Missing Modalities

MSFR-Net: Multi-modality and single-modality feature recalibration network for brain tumor segmentation

Multi-modal Brain Tumor Segmentation via Missing Modality Synthesis and Modality-level Attention Fusion

Brain Tumor Segmentation Network Using Attention-based Fusion and Spatial Relationship Constraint

Brain Tumor Segmentation on MRI with Missing Modalities

CMAF-Net: a cross-modal attention fusion-based deep neural network for incomplete multi-modal brain tumor segmentation

Brain Tumor Segmentation in Multimodal MRI Via Pixel-Level and Feature-Level Image Fusion.

Brain tumor image segmentation algorithm based on multimodal feature fusion of Bayesian weight distribution

Brain tumor segmentation by combining MultiEncoder UNet with wavelet fusion

Learning rich features with hybrid loss for brain tumor segmentation

MM-UNet: A multimodality brain tumor segmentation network in MRI images

Multi-modal brain tumor segmentation via disentangled representation learning and region-aware contrastive learning

Brain tumor segmentation based on the fusion of deep semantics and edge information in multimodal MRI