Few-Shot Learning for Medical Image Segmentation Using 3D U-Net and Model-Agnostic Meta-Learning (MAML)

Aqilah M. Alsaleh,Eid Albalawi,Abdulelah Algosaibi,Salman S. Albakheet,Surbhi Bhatia Khan
DOI: https://doi.org/10.3390/diagnostics14121213
IF: 3.6
2024-06-08
Diagnostics
Abstract:Deep learning has attained state-of-the-art results in general image segmentation problems; however, it requires a substantial number of annotated images to achieve the desired outcomes. In the medical field, the availability of annotated images is often limited. To address this challenge, few-shot learning techniques have been successfully adapted to rapidly generalize to new tasks with only a few samples, leveraging prior knowledge. In this paper, we employ a gradient-based method known as Model-Agnostic Meta-Learning (MAML) for medical image segmentation. MAML is a meta-learning algorithm that quickly adapts to new tasks by updating a model's parameters based on a limited set of training samples. Additionally, we use an enhanced 3D U-Net as the foundational network for our models. The enhanced 3D U-Net is a convolutional neural network specifically designed for medical image segmentation. We evaluate our approach on the TotalSegmentator dataset, considering a few annotated images for four tasks: liver, spleen, right kidney, and left kidney. The results demonstrate that our approach facilitates rapid adaptation to new tasks using only a few annotated images. In 10-shot settings, our approach achieved mean dice coefficients of 93.70%, 85.98%, 81.20%, and 89.58% for liver, spleen, right kidney, and left kidney segmentation, respectively. In five-shot sittings, the approach attained mean Dice coefficients of 90.27%, 83.89%, 77.53%, and 87.01% for liver, spleen, right kidney, and left kidney segmentation, respectively. Finally, we assess the effectiveness of our proposed approach on a dataset collected from a local hospital. Employing five-shot sittings, we achieve mean Dice coefficients of 90.62%, 79.86%, 79.87%, and 78.21% for liver, spleen, right kidney, and left kidney segmentation, respectively.
medicine, general & internal
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address the issue of data scarcity in medical image segmentation. Specifically: 1. **Background and Challenges**: - Deep learning has achieved significant results in general image segmentation tasks but requires a large number of annotated images to achieve the desired effect. - In the medical field, the number of annotated images is usually limited. - To tackle this challenge, researchers attempt to apply few-shot learning techniques to medical image segmentation, allowing for quick adaptation to new tasks with a small number of samples. 2. **Methods and Innovations**: - The paper adopts a gradient-based method—Model-Agnostic Meta-Learning (MAML) to quickly adapt to new segmentation tasks. - An enhanced 3D U-Net is used as the base network architecture, specifically designed for medical image segmentation. - The MAML meta-learning algorithm can quickly adapt to new tasks by updating model parameters, requiring only a few training samples to achieve this. 3. **Experimental Validation**: - Experiments were conducted on the TotalSegmentator dataset, evaluating segmentation for four tasks (liver, spleen, right kidney, and left kidney). - In the 10-shot setting, the average Dice coefficients achieved were 93.70%, 85.98%, 81.20%, and 89.58%, respectively. - In the 5-shot setting, the average Dice coefficients achieved were 90.27%, 83.89%, 77.53%, and 87.01%, respectively. - Further evaluation was conducted on a dataset collected from a local hospital, achieving average Dice coefficients of 90.62%, 79.86%, 79.87%, and 78.21% in the 5-shot setting. ### Summary This paper proposes a method combining MAML and an enhanced 3D U-Net to address the issue of data scarcity in medical image segmentation, achieving high-precision segmentation results with a small number of samples. This method performs excellently across multiple tasks and demonstrates its effectiveness in practical application scenarios.