Brain Tumor Segmentation in MRI Images with 3D U-Net and Contextual Transformer

Thien-Qua T. Nguyen,Hieu-Nghia Nguyen,Thanh-Hieu Bui,Thien B. Nguyen-Tat,Vuong M. Ngo
2024-07-11
Abstract:This research presents an enhanced approach for precise segmentation of brain tumor masses in magnetic resonance imaging (MRI) using an advanced 3D-UNet model combined with a Context Transformer (CoT). By architectural expansion CoT, the proposed model extends its architecture to a 3D format, integrates it smoothly with the base model to utilize the complex contextual information found in MRI scans, emphasizing how elements rely on each other across an extended spatial range. The proposed model synchronizes tumor mass characteristics from CoT, mutually reinforcing feature extraction, facilitating the precise capture of detailed tumor mass structures, including location, size, and boundaries. Several experimental results present the outstanding segmentation performance of the proposed method in comparison to current state-of-the-art approaches, achieving Dice score of 82.0%, 81.5%, 89.0% for Enhancing Tumor, Tumor Core and Whole Tumor, respectively, on BraTS2019.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of accurate segmentation of brain tumors in Magnetic Resonance Imaging (MRI). Specifically, the study proposes an improved method that combines the 3D U-Net model with the Context Transformer (CoT) to enhance the precision of brain tumor region segmentation. By extending the CoT architecture to a 3D format and smoothly integrating it with the base model, this method can leverage the complex contextual information in MRI scans, emphasizing long-distance dependencies between different elements. Additionally, the model achieves detailed capture of tumor structures (including location, size, and boundaries) by synchronizing tumor features extracted by CoT, mutually enhancing feature extraction. ### Main Contributions 1. **3D Context Transformer (CoT)**: Extends the 2D Context Transformer to a 3D format, combined with the 3D U-Net model, utilizing rich contextual information. 2. **Improved Segmentation Performance**: Experimental results show that this method achieves excellent segmentation performance on the BraTS2019 dataset, with Dice scores of 82.0%, 81.5%, and 89.0% for Enhancing Tumor, Tumor Core, and Whole Tumor labels, respectively. ### Background and Motivation Brain tumors are a serious disease that affects patients' health and quality of life. Traditional imaging methods such as X-rays and MRI can detect brain tumors but cannot provide detailed tumor information. Therefore, using modern diagnostic methods, especially artificial intelligence technology, to identify and classify brain tumors has become particularly important. Automating this process not only reduces costs and time but also alleviates the burden on staff and the healthcare system, improving efficiency and resource utilization. ### Methods 1. **3D U-Net Model**: A commonly used neural network for 3D medical image processing, analyzing spatial information through down-sampling and up-sampling layers to achieve high-precision 3D segmentation. 2. **3D Context Transformer (CoT)**: Combines self-attention mechanisms and contextual information, effectively supporting the self-attention learning process and enhancing the representation capability of the output feature map. 3. **Loss Function**: Uses a weighted combination of Dice loss and cross-entropy loss to optimize model parameters. ### Experimental Results 1. **Ablation Study**: Experimental results show that the model combined with CoT significantly reduces segmentation errors in all tumor regions, particularly improving the Dice score in the Enhancing Tumor region by 5.6%. 2. **Modality Impact Assessment**: Analysis of the impact of different modalities on model performance indicates that the T1c modality is crucial for the segmentation performance of the Tumor Core and Enhancing Tumor regions, while the FLAIR modality significantly affects the segmentation performance of the Whole Tumor region. 3. **Comparison with Existing Methods**: Compared to current state-of-the-art methods, this model performs excellently on the BraTS2019 validation set, particularly achieving a Dice score of 82.0% in the Enhancing Tumor region. ### Conclusion and Future Work This study proposes a powerful multi-modal brain tumor segmentation technique that improves segmentation accuracy by combining 3D U-Net and CoT. Experimental results validate the effectiveness of this method. Future research will focus on optimizing computational resources, improving preprocessing techniques, and extending to other medical image segmentation tasks such as liver fibrosis, hepatitis, and lung lesions.