Multimodal MRI brain tumor segmentation using 3D attention UNet with dense encoder blocks and residual decoder blocks

DOI: https://doi.org/10.1007/s11042-024-18942-1
IF: 2.577
2024-04-24
Multimedia Tools and Applications
Abstract:Medical image segmentation is essential for disease diagnosis and for supporting medical decision systems. Automatic segmentation of brain tumors from Magnetic Resonance Imaging (MRI) is crucial for treatment planning and timely diagnosis. Due to the enormous amount of data that MRI provides as well as the variability in the location and size of the tumor, automatic segmentation is a difficult process. Consequently, a current outstanding problem in the field of deep learning-based medical image analysis is the development of an accurate and trustworthy way to separate the tumorous region from healthy tissues. In this paper, we propose a 3D Attention U-Net with dense encoder blocks and residual decoder blocks, which combines the benefits of both DenseNet and ResNet. Dense blocks with transition layers help to strengthen feature propagation, reduce vanishing gradient, and increase the receptive field. Because each layer receives feature maps from all previous layers, the network can be made thinner and more compact. To make predictions, it considers both low-level and high-level features at the same time. In addition, shortcut connections between the residual network are used to preserve low-level features at each level. As part of the proposed architecture, skip connections between dense and residual blocks are utilized along with an attention layer to speed up the training process. The proposed architecture was trained and validated using BraTS 2020 dataset, it showed promising results with dice scores of 0.866, 0.889, and 0.828 for the tumor core (TC), whole tumor (WT), and enhancing tumor (ET), respectively. In comparison to the original 3D UNet, our approach performs better. According to the findings of our experiment, our approach is a competitive automatic brain tumor segmentation method when compared to some state-of-the-art techniques and can be applied to clinically challenging medical imaging problems and other segmentation applications.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?
The problem this paper attempts to address is improving the accuracy of automatic brain tumor segmentation in multi-modal magnetic resonance imaging (MRI) using a 3D Attention U-Net architecture. Specifically, the researchers propose a 3D Attention U-Net model that combines the advantages of DenseNet and ResNet, aiming to overcome the limitations of existing methods and enhance the accuracy of brain tumor region (ROI) segmentation. The main contributions include: 1. **Dense Encoder and Residual Decoder**: Combining the strengths of DenseNet and ResNet, the model enhances feature propagation through dense blocks and transition layers, and retains low-level features through residual blocks. This effectively addresses the vanishing gradient problem and increases the receptive field. 2. **3D Spatial and Channel Attention Module**: Introducing a 3D spatial and channel attention module to improve the network's ability to focus on important regions, thereby making more precise predictions while reducing computational overhead. 3. **Skip Connections**: Using skip connections between residual blocks and between the dense encoder and residual decoder to ensure that low-level features are retained at each level, thereby improving the quality of the segmentation results. The researchers trained and validated the proposed model using the BraTS 2020 dataset and achieved excellent results, with Dice scores of 0.866 for tumor core (TC), 0.889 for whole tumor (WT), and 0.828 for enhancing tumor (ET). These results indicate that the method is competitive with other state-of-the-art methods and can be applied to clinically challenging medical imaging problems and other segmentation tasks.