Deep Semantic Segmentation of Natural and Medical Images: A Review

Saeid Asgari Taghanaki,Kumar Abhishek,Joseph Paul Cohen,Julien Cohen-Adad,Ghassan Hamarneh
2024-03-31
Abstract:The semantic image segmentation task consists of classifying each pixel of an image into an instance, where each instance corresponds to a class. This task is a part of the concept of scene understanding or better explaining the global context of an image. In the medical image analysis domain, image segmentation can be used for image-guided interventions, radiotherapy, or improved radiological diagnostics. In this review, we categorize the leading deep learning-based medical and non-medical image segmentation solutions into six main groups of deep architectural, data synthesis-based, loss function-based, sequenced models, weakly supervised, and multi-task methods and provide a comprehensive review of the contributions in each of these groups. Further, for each group, we analyze each variant of these groups and discuss the limitations of the current approaches and present potential future research directions for semantic image segmentation.
Computer Vision and Pattern Recognition,Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
This paper primarily reviews the tasks of semantic segmentation for natural and medical images, systematically categorizing and discussing deep learning-based methods. The main objectives of the paper are: 1. **Comprehensive Coverage of Research Contributions**: It provides a comprehensive overview of research work in the field of semantic segmentation for natural and medical images. Special attention is given to medical imaging modalities, including 2D (RGB and grayscale) and 3D volumetric medical images. 2. **Method Classification**: The literature is divided into six categories: architecture improvements, optimization function improvements, data synthesis improvements, weakly supervised models, sequential models, and multi-task models, with detailed analysis of each category. 3. **Study of Loss Function Behavior**: It analyzes the performance of various popular loss functions in handling different levels of false positive and false negative predictions. 4. **Future Research Directions**: Based on the comprehensive review, it identifies important future research directions within each category. Specifically, the paper first introduces the basic definition and importance of the semantic segmentation task, followed by a detailed discussion of the progress in various methods. For example, in the **network architecture improvements** section, the paper reviews the development from fully convolutional networks (FCNs) to encoder-decoder models (such as U-Net and V-Net), and then to more complex structures (such as DeepLabV3+). Additionally, it explores how techniques like attention mechanisms and adversarial training can further enhance model performance. In the **medical images** section, the paper emphasizes the role of model compression, attention mechanisms, and adversarial training in improving the accuracy of medical image segmentation. These methods are particularly important when dealing with high-resolution or three-dimensional images, as they help achieve real-time processing and reduce memory consumption. Overall, this review paper aims to provide readers with a comprehensive perspective on the application of deep learning in the field of semantic segmentation for natural and medical images, and to offer guidance for future related research.