Semantic Polyp Generation for Improving Polyp Segmentation Performance

Hun Song,Younghak Shin
DOI: https://doi.org/10.1007/s40846-024-00854-y
IF: 2
2024-03-23
Journal of Medical and Biological Engineering
Abstract:To improve the performance of deep-learning-based image segmentation, a sufficient amount of training data is required. However, it is more difficult to obtain training images and segmentation masks for medical images than for general images. In deep-learning-based colon polyp detection and segmentation, research has recently been conducted to improve performance by generating polyp images using a generative model, and then adding them to training data.
engineering, biomedical
What problem does this paper attempt to address?
The paper aims to address the issue of insufficient training data in the task of polyp image segmentation in colonoscopy using deep learning. Specifically, the research focuses on the following aspects: 1. **Difficulty in Data Collection**: Medical images, such as colonoscopy polyp images, are harder to obtain compared to general images, mainly due to privacy protection and restrictions on the use of personal medical data. 2. **High Annotation Cost**: Even if sufficient data is available, it requires professional experts to annotate the polyp masks, which consumes a lot of time and cost. 3. **Lack of Data Diversity**: Existing public datasets are insufficient to cover the diversity of polyps, leading to limited performance of deep learning models. To overcome these challenges, the paper proposes the SemanticPolypGAN model, which is used to generate synthetic colonoscopy polyp images and add them to the training dataset to improve polyp segmentation performance. Compared to existing methods, SemanticPolypGAN has the following features: - **No Additional Input Preparation**: It can generate polyp images and corresponding mask images without the need for additional input condition preparation steps. - **Independent Control Capability**: It can independently control the shape and texture of the polyp part and the non-polyp part (i.e., the intestinal surface). - **Diversity and Quality**: By randomly modifying the latent vectors of the generated polyp images, it can control the shape and texture of the polyps, thereby generating diverse images. Experimental results show that in different polyp segmentation models (such as UACANet, PraNet, etc.), adding images generated by SemanticPolypGAN to the training data improves overall performance. Additionally, compared to previous methods, images generated using this model can more effectively improve segmentation performance.