Polyp segmentation based on implicit edge-guided cross-layer fusion networks

Junqing Liu,Weiwei Zhang,Yong Liu,Qinghe Zhang
DOI: https://doi.org/10.1038/s41598-024-62331-5
IF: 4.6
2024-05-24
Scientific Reports
Abstract:Polyps are abnormal tissue clumps growing primarily on the inner linings of the gastrointestinal tract. While such clumps are generally harmless, they can potentially evolve into pathological tumors, and thus require long-term observation and monitoring. Polyp segmentation in gastrointestinal endoscopy images is an important stage for polyp monitoring and subsequent treatment. However, this segmentation task faces multiple challenges: the low contrast of the polyp boundaries, the varied polyp appearance, and the co-occurrence of multiple polyps. So, in this paper, an implicit edge-guided cross-layer fusion network (IECFNet) is proposed for polyp segmentation. The codec pair is used to generate an initial saliency map, the implicit edge-enhanced context attention module aggregates the feature graph output from the encoding and decoding to generate the rough prediction, and the multi-scale feature reasoning module is used to generate final predictions. Polyp segmentation experiments have been conducted on five popular polyp image datasets (Kvasir, CVC-ClinicDB, ETIS, CVC-ColonDB, and CVC-300), and the experimental results show that the proposed method significantly outperforms a conventional method, especially with an accuracy margin of 7.9% on the ETIS dataset.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper aims to address the challenges in gastrointestinal polyp segmentation, especially the problems of low - contrast polyp boundaries, diverse polyp appearances, and the co - existence of multiple polyps. These problems make it very difficult to accurately segment polyps in endoscopic images. To meet these challenges, the paper proposes an Implicit Edge - Guided Cross - Layer Fusion Network (IECFNet) method for polyp segmentation. Specifically, this method addresses the above problems through the following aspects: 1. **Initial Saliency Map Generation**: Use an encoder - decoder pair to generate an initial saliency map. 2. **Implicit Edge - Enhanced Contextual Attention Module**: Aggregate the feature map outputs of the encoding and decoding stages to generate a rough prediction. 3. **Multi - Scale Feature Inference Module**: Generate the final prediction result. The paper conducted experiments on five popular polyp image datasets (Kvasir, CVC - ClinicDB, ETIS, CVC - ColonDB, and CVC - 300). The experimental results show that the proposed method significantly outperforms traditional methods in multiple metrics, especially with a 7.9% improvement in accuracy on the ETIS dataset. ### Key Contributions 1. **New Deep Network Model**: This model improves the segmentation effect by effectively using global context information, cross - layer feature fusion, low - level feature refinement, and multi - scale feature inference. 2. **Expand Spatial Receptive Field**: Propose the Receptive Field Coordinate Attention Encoder (RFCA - e) to expand the spatial receptive field of the backbone network. 3. **Implicit Edge - Enhanced Contextual Attention Module**: Designed based on the multi - head self - attention mechanism and edge information, it compensates for the lack of explicit shape - boundary information. 4. **Multi - Scale Feature Inference Module**: Through multi - scale feature inference, detect and fuse image features at different scales, and finally obtain the segmentation result. ### Method Overview The IECFNet framework proposed in the paper includes the following main modules: - **Receptive Field Coordinate Attention Encoder and Decoder Pairs** (RFCA - e and RFCA - d): Used to expand the receptive field of the network and reduce computational costs. - **Implicit Edge - Enhanced Contextual Attention Module** (IECA): Through the multi - head self - attention mechanism and edge information, extract rich semantic features. - **Multi - Scale Feature Inference Module** (MSFR): Utilize low - level features and multi - scale strategies to generate the final segmentation result. Through the collaborative work of these modules, IECFNet can achieve more accurate segmentation in complex polyp images, thus providing important support for polyp monitoring and subsequent treatment.