ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic Polyp Detection

Yuncheng Jiang,Zixun Zhang,Yiwen Hu,Guanbin Li,Xiang Wan,Song Wu,Shuguang Cui,Silin Huang,Zhen Li
2024-01-10
Abstract:Accurate polyp detection is critical for early colorectal cancer diagnosis. Although remarkable progress has been achieved in recent years, the complex colon environment and concealed polyps with unclear boundaries still pose severe challenges in this area. Existing methods either involve computationally expensive context aggregation or lack prior modeling of polyps, resulting in poor performance in challenging cases. In this paper, we propose the Enhanced CenterNet with Contrastive Learning (ECC-PolypDet), a two-stage training \& end-to-end inference framework that leverages images and bounding box annotations to train a general model and fine-tune it based on the inference score to obtain a final robust model. Specifically, we conduct Box-assisted Contrastive Learning (BCL) during training to minimize the intra-class difference and maximize the inter-class difference between foreground polyps and backgrounds, enabling our model to capture concealed polyps. Moreover, to enhance the recognition of small polyps, we design the Semantic Flow-guided Feature Pyramid Network (SFFPN) to aggregate multi-scale features and the Heatmap Propagation (HP) module to boost the model's attention on polyp targets. In the fine-tuning stage, we introduce the IoU-guided Sample Re-weighting (ISR) mechanism to prioritize hard samples by adaptively adjusting the loss weight for each sample during fine-tuning. Extensive experiments on six large-scale colonoscopy datasets demonstrate the superiority of our model compared with previous state-of-the-art detectors.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the accuracy issue of polyp detection in the early diagnosis of colorectal cancer (CRC). Despite significant progress in polyp detection in recent years, the complex colon environment and the presence of hidden polyps with blurred boundaries still pose severe challenges to this task. Existing methods either have high computational costs requiring complex context aggregation or lack prior modeling of polyps, leading to poor performance in difficult situations. Specifically, this paper proposes Enhanced CenterNet with Contrastive Learning (ECC-PolypDet), a two-stage training and end-to-end inference framework. This framework uses images and bounding box annotations to train a general model and fine-tunes it based on inference scores to obtain a final robust model. The main contributions include: 1. **Hidden Polyp Recognition**: By using the Box-assisted Contrastive Learning (BCL) module, intra-class differences are minimized and inter-class differences are maximized during training, thereby better capturing hidden polyps. 2. **Enhanced Small Polyp Recognition**: The Semantic Flow-guided Feature Pyramid Network (SFFPN) and Heatmap Propagation (HP) modules are designed for multi-scale feature fusion and enhanced small polyp recognition. 3. **Hard Sample Optimization**: The IoU-guided Sample Reweighting (ISR) mechanism is introduced, which adaptively adjusts the loss weight based on the inference IoU score of each sample during the second stage of training, thereby improving the model's robustness and generalization ability. Extensive experiments on 6 large-scale colonoscopy datasets demonstrate that the proposed method outperforms existing state-of-the-art methods.