Progressive Group Convolution Fusion Network for Colon Polyp Segmentation

Zexuan Ji,Hao Qian,Xiao Ma
DOI: https://doi.org/10.1016/j.bspc.2024.106586
IF: 5.1
2024-01-01
Biomedical Signal Processing and Control
Abstract:In the field of medical imaging, the automatic detection and segmentation of colon polyps is particularly crucial for the early diagnosis of colorectal cancer. However, existing methods often face limitations when processing polyp images, especially under low-contrast and blurred boundary conditions, which hinder the recognition of complex features and thus affect the accuracy and efficiency of diagnosis. The challenge is further compounded by a lack of flexibility and precision in differentiating polyps of various sizes and shapes. To address these challenges, this study presents an advanced segmentation method that integrates a Pyramid Vision Transformer (PVT) encoder with a Convolutional Neural Network (CNN) decoder. The encoder, which utilizes the multi-level transformer modules of the PVT, effectively captures the intricate details and contextual information of the image, enabling precise extraction of complex features within polyp images. The decoder incorporates a Progressive Grouped Convolutional Fusion (PGCF) module that extracts multi-scale features through dilated convolutional kernels with different dilation rates. Coupled with attention mechanisms and differential subtraction strategies, our method not only enhances the feature fusion capability but also significantly improves the delineation of polyp boundaries. By integrating the PGCF module, differential operations, and multi-scale fusion strategies, our approach overcomes the limitations of existing colon polyp segmentation techniques. Experimental results on a large-scale annotated colon polyp image dataset show that our method demonstrates excellent performance and robustness in localizing and segmenting polyps of diverse sizes, shapes, and textures. The source codes are available at: https://github.com/peanutHao/PGCF.
What problem does this paper attempt to address?