Using DUCK-Net for Polyp Image Segmentation

Razvan-Gabriel Dumitru,Darius Peteleaza,Catalin Craciun
DOI: https://doi.org/10.1038/s41598-023-36940-5
2023-11-04
Abstract:This paper presents a novel supervised convolutional neural network architecture, "DUCK-Net", capable of effectively learning and generalizing from small amounts of medical images to perform accurate segmentation tasks. Our model utilizes an encoder-decoder structure with a residual downsampling mechanism and a custom convolutional block to capture and process image information at multiple resolutions in the encoder segment. We employ data augmentation techniques to enrich the training set, thus increasing our model's performance. While our architecture is versatile and applicable to various segmentation tasks, in this study, we demonstrate its capabilities specifically for polyp segmentation in colonoscopy images. We evaluate the performance of our method on several popular benchmark datasets for polyp segmentation, Kvasir-SEG, CVC-ClinicDB, CVC-ColonDB, and ETIS-LARIBPOLYPDB showing that it achieves state-of-the-art results in terms of mean Dice coefficient, Jaccard index, Precision, Recall, and Accuracy. Our approach demonstrates strong generalization capabilities, achieving excellent performance even with limited training data. The code is publicly available on GitHub: <a class="link-external link-https" href="https://github.com/RazvanDu/DUCK-Net" rel="external noopener nofollow">this https URL</a>
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve high - precision segmentation effects on a limited data set in medical image segmentation tasks, especially in polyp segmentation in colonoscopy images. Specifically, the authors propose a new supervised convolutional neural network architecture - DUCK - Net, aiming to improve the existing polyp segmentation methods through the following points: 1. **Reduce the dependence on a large amount of labeled data**: Traditional deep - learning algorithms usually require a large amount of labeled data for training, which is especially difficult in the medical field because high - quality labeled data is costly and time - consuming to obtain. DUCK - Net can effectively learn and generalize on small - scale data sets by using the residual down - sampling mechanism and custom convolutional blocks. 2. **Improve segmentation accuracy**: DUCK - Net utilizes an encoder - decoder structure and combines multiple convolutional blocks (such as residual blocks, medium - field - of - view blocks, wide - field - of - view blocks, and separable blocks) to capture and process image information at multiple resolutions, thereby improving the ability to accurately predict polyp boundaries. 3. **Enhance the generalization ability of the model**: Through data augmentation techniques, DUCK - Net can extract more features from limited training data, thereby improving the generalization performance of the model on different data sets. Experimental results show that DUCK - Net has achieved state - of - the - art results on multiple benchmark data sets (such as Kvasir - SEG, CVC - ClinicDB, CVC - ColonDB, and ETIS - LARIBPOLYPDB), especially performing excellently in metrics such as the average Dice coefficient, Jaccard index, precision, recall, and accuracy. 4. **No pre - training required**: DUCK - Net does not use any external modules and is only trained on the target data set. It does not require pre - training, which further simplifies the model training process and reduces the dependence on large - scale pre - training data. In conclusion, the main contribution of this paper lies in proposing an innovative convolutional neural network architecture that can achieve high - precision polyp segmentation on a limited data set while having good generalization ability and robustness.