3D Medical Image Segmentation based on multi-scale MPU-Net

Zeqiu.Yu,Shuo.Han,Ziheng.Song
2023-07-25
Abstract:The high cure rate of cancer is inextricably linked to physicians' accuracy in diagnosis and treatment, therefore a model that can accomplish high-precision tumor segmentation has become a necessity in many applications of the medical industry. It can effectively lower the rate of misdiagnosis while considerably lessening the burden on clinicians. However, fully automated target organ segmentation is problematic due to the irregular stereo structure of 3D volume organs. As a basic model for this class of real applications, U-Net excels. It can learn certain global and local features, but still lacks the capacity to grasp spatial long-range relationships and contextual information at multiple scales. This paper proposes a tumor segmentation model MPU-Net for patient volume CT images, which is inspired by Transformer with a global attention mechanism. By combining image serialization with the Position Attention Module, the model attempts to comprehend deeper contextual dependencies and accomplish precise positioning. Each layer of the decoder is also equipped with a multi-scale module and a cross-attention mechanism. The capability of feature extraction and integration at different levels has been enhanced, and the hybrid loss function developed in this study can better exploit high-resolution characteristic information. Moreover, the suggested architecture is tested and evaluated on the Liver Tumor Segmentation Challenge 2017 (LiTS 2017) dataset. Compared with the benchmark model U-Net, MPU-Net shows excellent segmentation results. The dice, accuracy, precision, specificity, IOU, and MCC metrics for the best model segmentation results are 92.17%, 99.08%, 91.91%, 99.52%, 85.91%, and 91.74%, respectively. Outstanding indicators in various aspects illustrate the exceptional performance of this framework in automatic medical image segmentation.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper mainly addresses the challenges in medical image segmentation, particularly the high-precision automatic segmentation of tumors in 3D medical imaging. The authors propose a new model, MPU-Net, aimed at improving the existing U-Net architecture to better handle three-dimensional volumetric data and enhance the accuracy of tumor segmentation. The specific problems addressed in the paper include: 1. **Improving the effectiveness of medical image semantic segmentation**: By proposing a new 3D medical image model (MPU-Net) based on the U-Net framework to improve segmentation accuracy. 2. **Effectively capturing long-range dependency information**: Utilizing the Position Attention Module (PAM) in volumetric medical images to capture long-range dependency information. 3. **Reconstructing the U-Net nested network**: By sampling the self-attention features of PAM and combining them with CNN features of different resolutions to generate the predicted segmentation output. 4. **Enhancing the efficiency of long-term sequence feature retrieval**: Replacing traditional fast convolution layers with a Hierarchical Convolutional Neural Network (HCNN) to improve the efficiency of long-term sequence feature retrieval. 5. **Developing a new loss function**: Proposing a new loss function that combines Tversky loss and cross-entropy, which continues to decrease even after the model converges. 6. **Multi-scale information fusion**: Introducing a cross-attention mechanism based on multi-scale blocks and adopting multi-scale output to achieve the fusion of feature information at different resolutions. The main contribution of the paper is the proposal of a novel framework, MPU-Net, which combines position attention mechanisms and multi-scale attention mechanisms to improve tumor segmentation in 3D medical images. Additionally, a series of experiments validate the model's advantages in segmentation performance compared to the baseline model U-Net.