Abstract:Colonoscopy is a reliable diagnostic method to detect colorectal polyps early on and prevent colorectal cancer. The current examination techniques face a significant challenge of high missed rates, resulting in numerous undetected polyps and irregularities. Automated and real-time segmentation methods can help endoscopists to segment the shape and location of polyps from colonoscopy images in order to facilitate clinician's timely diagnosis and interventions. Different parameters like shapes, small sizes of polyps, and their close resemblance to surrounding tissues make this task challenging. Furthermore, high-definition image quality and reliance on the operator make real-time and accurate endoscopic image segmentation more challenging. Deep learning models utilized for segmenting polyps, designed to capture diverse patterns, are becoming progressively complex. This complexity poses challenges for real-time medical operations. In clinical settings, utilizing automated methods requires the development of accurate, lightweight models with minimal latency, ensuring seamless integration with endoscopic hardware devices. To address these challenges, in this study a novel lightweight and more generalized Enhanced Nanonet model, an improved version of Nanonet using NanonetB for real-time and precise colonoscopy image segmentation, is proposed. The proposed model enhances the performance of Nanonet using Nanonet B on the overall prediction scheme by applying data augmentation, Conditional Random Field (CRF), and Test-Time Augmentation (TTA). Six publicly available datasets are utilized to perform thorough evaluations, assess generalizability, and validate the improvements: Kvasir-SEG, Endotect Challenge 2020, Kvasir-instrument, CVC-ClinicDB, CVC-ColonDB, and CVC-300. Through extensive experimentation, using the Kvasir-SEG dataset, our model achieves a mIoU score of 0.8188 and a Dice coefficient of 0.8060 with only 132,049 parameters and employing minimal computational resources. A thorough cross-dataset evaluation was performed to assess the generalization capability of the proposed Enhanced Nanonet model across various publicly available polyp datasets for potential real-world applications. The result of this study shows that using CRF (Conditional Random Fields) and TTA (Test-Time Augmentation) enhances performance within the same dataset and also across diverse datasets with a model size of just 132,049 parameters. Also, the proposed method indicates improved results in detecting smaller and sessile polyps (flats) that are significant contributors to the high miss rates.

NanoNet: Real-Time Polyp Segmentation in Video Capsule Endoscopy and Colonoscopy

Enhanced accuracy with Segmentation of Colorectal Polyp using NanoNetB, and Conditional Random Field Test-Time Augmentation

Context aware decision support in neurosurgical oncology based on an efficient classification of endomicroscopic data

Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution Network

Real-Time Gastric Polyp Detection Using Convolutional Neural Networks

BetterNet: An Efficient CNN Architecture with Residual Learning and Attention for Precision Polyp Segmentation

DDANet: Dual Decoder Attention Network for Automatic Polyp Segmentation

TransNetR: Transformer-based Residual Network for Polyp Segmentation with Multi-Center Out-of-Distribution Testing

Polyp Segmentation with Fully Convolutional Deep Neural Networks—Extended Evaluation Study

Towards a Computed-Aided Diagnosis System in Colonoscopy: Automatic Polyp Segmentation Using Convolution Neural Networks

IRv2-Net: A Deep Learning Framework for Enhanced Polyp Segmentation Performance Integrating InceptionResNetV2 and UNet Architecture with Test Time Augmentation Techniques

Video Capsule Endoscopy Classification using Focal Modulation Guided Convolutional Neural Network

Automatic Polyp Segmentation in Colonoscopy Images Using a Modified Deep Convolutional Encoder-Decoder Architecture

Segmentation of polyps based on pyramid vision transformers and residual block for real-time endoscopy imaging

SSN: A Stair-Shape Network for Real-Time Polyp Segmentation in Colonoscopy Images.

ESFPNet: Efficient Stage-Wise Feature Pyramid on Mix Transformer for Deep Learning-Based Cancer Analysis in Endoscopic Video

Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy

Diagnosing Colorectal Polyps in the Wild with Capsule Networks

TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation

Polyp segmentation network based on lightweight model and reverse attention mechanisms

Modified DeeplabV3+ with multi-level context attention mechanism for colonoscopy polyp segmentation