Flexible Colon Polyp Detection: A Dual Mode Approach for Detection and Segmentation of Colon Polyps with Optional Inpainting for Specular Highlight Mitigation

Geetha Sushama,Gopakumar Chandrasekhara Menon
DOI: https://doi.org/10.1007/s42979-024-02932-z
2024-06-12
SN Computer Science
Abstract:Colorectal cancer (CRC) is one of the top ten cancers in terms of both incidence and mortality rates in India, and it ranks among the top three cancers in the world. Early detection and removal of malicious colorectal polyp regions, which are precursors of this cancer condition, can enormously lessen the deaths due to colorectal cancer. Machine learning, particularly deep learning based on convolutional neural networks (CNNs), is actively employed in the medical domain for the detection and segmentation of malignant regions, but the generalizability of the model remains an open challenge. In this paper, we address the challenge of improving polyp detection, localization, and segmentation in endoscopic images by mitigating selection bias and facilitating on-demand removal of specular effects as needed by medical professionals. Our proposed method incorporates both polyp and non-polyp images for model development and selectively eliminates specular effects to enhance image quality. Additionally we introduce a deep CNN for region suggestion, enabling simultaneous detection, localization, and segmentation of polyp regions. The model is trained on multi center polyp detection and segmentation dataset, PolypGen. To assess generalizability, we evaluate the model on recently published datasets such as the Gastrointestinal atlas-Colon Polyp dataset and Gastrolab-Polyp test dataset, as well as the widely used ETIS-Larib dataset after initially training on different datasets. The evaluation results demonstrate that our model, trained on a multicenter polyp detection and segmentation dataset containing both positive and negative samples, produces promising results. The model consistently exhibits performance metrics, including precision, recall, F1, and F2 scores, exceeding 75% across all evaluated datasets.
What problem does this paper attempt to address?