Abstract:This paper presents a novel supervised convolutional neural network architecture, "DUCK-Net", capable of effectively learning and generalizing from small amounts of medical images to perform accurate segmentation tasks. Our model utilizes an encoder-decoder structure with a residual downsampling mechanism and a custom convolutional block to capture and process image information at multiple resolutions in the encoder segment. We employ data augmentation techniques to enrich the training set, thus increasing our model's performance. While our architecture is versatile and applicable to various segmentation tasks, in this study, we demonstrate its capabilities specifically for polyp segmentation in colonoscopy images. We evaluate the performance of our method on several popular benchmark datasets for polyp segmentation, Kvasir-SEG, CVC-ClinicDB, CVC-ColonDB, and ETIS-LARIBPOLYPDB showing that it achieves state-of-the-art results in terms of mean Dice coefficient, Jaccard index, Precision, Recall, and Accuracy. Our approach demonstrates strong generalization capabilities, achieving excellent performance even with limited training data. The code is publicly available on GitHub: <a class="link-external link-https" href="https://github.com/RazvanDu/DUCK-Net" rel="external noopener nofollow">this https URL</a>

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to achieve high - precision segmentation effects on a limited data set in medical image segmentation tasks, especially in polyp segmentation in colonoscopy images. Specifically, the authors propose a new supervised convolutional neural network architecture - DUCK - Net, aiming to improve the existing polyp segmentation methods through the following points: 1. **Reduce the dependence on a large amount of labeled data**: Traditional deep - learning algorithms usually require a large amount of labeled data for training, which is especially difficult in the medical field because high - quality labeled data is costly and time - consuming to obtain. DUCK - Net can effectively learn and generalize on small - scale data sets by using the residual down - sampling mechanism and custom convolutional blocks. 2. **Improve segmentation accuracy**: DUCK - Net utilizes an encoder - decoder structure and combines multiple convolutional blocks (such as residual blocks, medium - field - of - view blocks, wide - field - of - view blocks, and separable blocks) to capture and process image information at multiple resolutions, thereby improving the ability to accurately predict polyp boundaries. 3. **Enhance the generalization ability of the model**: Through data augmentation techniques, DUCK - Net can extract more features from limited training data, thereby improving the generalization performance of the model on different data sets. Experimental results show that DUCK - Net has achieved state - of - the - art results on multiple benchmark data sets (such as Kvasir - SEG, CVC - ClinicDB, CVC - ColonDB, and ETIS - LARIBPOLYPDB), especially performing excellently in metrics such as the average Dice coefficient, Jaccard index, precision, recall, and accuracy. 4. **No pre - training required**: DUCK - Net does not use any external modules and is only trained on the target data set. It does not require pre - training, which further simplifies the model training process and reduces the dependence on large - scale pre - training data. In conclusion, the main contribution of this paper lies in proposing an innovative convolutional neural network architecture that can achieve high - precision polyp segmentation on a limited data set while having good generalization ability and robustness.

Using DUCK-Net for Polyp Image Segmentation

BetterNet: An Efficient CNN Architecture with Residual Learning and Attention for Precision Polyp Segmentation

ECTransNet: An Automatic Polyp Segmentation Network Based on Multi-scale Edge Complementary

DDANet: Dual Decoder Attention Network for Automatic Polyp Segmentation

Automatic Polyp Segmentation in Colonoscopy Images Using a Modified Deep Convolutional Encoder-Decoder Architecture

Multi parallel U-net encoder network for effective polyp image segmentation

Efficient colorectal polyp segmentation using wavelet transformation and AdaptUNet: A hybrid U-Net

An Efficient Polyp Segmentation Network

An approach of polyp segmentation from colonoscopy images using Dilated-U-Net-Seg – A deep learning network

Multi‐scale nested UNet with transformer for colorectal polyp segmentation

TransRUPNet for Improved Polyp Segmentation

Dual-branch multi-information aggregation network with transformer and convolution for polyp segmentation

Dual encoder–decoder-based deep polyp segmentation network for colonoscopy images

Learnable Oriented-Derivative Network for Polyp Segmentation

FeDNet: Feature Decoupled Network for Polyp Segmentation from Endoscopy Images

TransNetR: Transformer-based Residual Network for Polyp Segmentation with Multi-Center Out-of-Distribution Testing

Polyp Segmentation in Colonoscopy Images using U-Net-MobileNetV2

DPE-Net: Dual-Parallel Encoder Based Network for Semantic Segmentation of Polyps

PDCA-Net: Parallel dual-channel attention network for polyp segmentation

Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution Network

CRPU-NET: a deep learning model based semantic segmentation for the detection of colorectal polyp in lower gastrointestinal tract