Abstract:Polyps are abnormal tissue clumps growing primarily on the inner linings of the gastrointestinal tract. While such clumps are generally harmless, they can potentially evolve into pathological tumors, and thus require long-term observation and monitoring. Polyp segmentation in gastrointestinal endoscopy images is an important stage for polyp monitoring and subsequent treatment. However, this segmentation task faces multiple challenges: the low contrast of the polyp boundaries, the varied polyp appearance, and the co-occurrence of multiple polyps. So, in this paper, an implicit edge-guided cross-layer fusion network (IECFNet) is proposed for polyp segmentation. The codec pair is used to generate an initial saliency map, the implicit edge-enhanced context attention module aggregates the feature graph output from the encoding and decoding to generate the rough prediction, and the multi-scale feature reasoning module is used to generate final predictions. Polyp segmentation experiments have been conducted on five popular polyp image datasets (Kvasir, CVC-ClinicDB, ETIS, CVC-ColonDB, and CVC-300), and the experimental results show that the proposed method significantly outperforms a conventional method, especially with an accuracy margin of 7.9% on the ETIS dataset.

What problem does this paper attempt to address?

This paper aims to address the challenges in gastrointestinal polyp segmentation, especially the problems of low - contrast polyp boundaries, diverse polyp appearances, and the co - existence of multiple polyps. These problems make it very difficult to accurately segment polyps in endoscopic images. To meet these challenges, the paper proposes an Implicit Edge - Guided Cross - Layer Fusion Network (IECFNet) method for polyp segmentation. Specifically, this method addresses the above problems through the following aspects: 1. **Initial Saliency Map Generation**: Use an encoder - decoder pair to generate an initial saliency map. 2. **Implicit Edge - Enhanced Contextual Attention Module**: Aggregate the feature map outputs of the encoding and decoding stages to generate a rough prediction. 3. **Multi - Scale Feature Inference Module**: Generate the final prediction result. The paper conducted experiments on five popular polyp image datasets (Kvasir, CVC - ClinicDB, ETIS, CVC - ColonDB, and CVC - 300). The experimental results show that the proposed method significantly outperforms traditional methods in multiple metrics, especially with a 7.9% improvement in accuracy on the ETIS dataset. ### Key Contributions 1. **New Deep Network Model**: This model improves the segmentation effect by effectively using global context information, cross - layer feature fusion, low - level feature refinement, and multi - scale feature inference. 2. **Expand Spatial Receptive Field**: Propose the Receptive Field Coordinate Attention Encoder (RFCA - e) to expand the spatial receptive field of the backbone network. 3. **Implicit Edge - Enhanced Contextual Attention Module**: Designed based on the multi - head self - attention mechanism and edge information, it compensates for the lack of explicit shape - boundary information. 4. **Multi - Scale Feature Inference Module**: Through multi - scale feature inference, detect and fuse image features at different scales, and finally obtain the segmentation result. ### Method Overview The IECFNet framework proposed in the paper includes the following main modules: - **Receptive Field Coordinate Attention Encoder and Decoder Pairs** (RFCA - e and RFCA - d): Used to expand the receptive field of the network and reduce computational costs. - **Implicit Edge - Enhanced Contextual Attention Module** (IECA): Through the multi - head self - attention mechanism and edge information, extract rich semantic features. - **Multi - Scale Feature Inference Module** (MSFR): Utilize low - level features and multi - scale strategies to generate the final segmentation result. Through the collaborative work of these modules, IECFNet can achieve more accurate segmentation in complex polyp images, thus providing important support for polyp monitoring and subsequent treatment.

Polyp segmentation based on implicit edge-guided cross-layer fusion networks

ECTransNet: An Automatic Polyp Segmentation Network Based on Multi-scale Edge Complementary

Cross-Level Context Fusion Network for Polyp Segmentation in Colonoscopy Images

Probabilistic Modeling Ensemble Vision Transformer Improves Complex Polyp Segmentation

CIFG-Net: Cross-level information fusion and guidance network for Polyp Segmentation

An Edge-Enhanced Network for Polyp Segmentation

Edge-aware Feature Aggregation Network for Polyp Segmentation

Multi-Dimensional Fusion Reverse Attention Network for Polyp Segmentation

A Local Region Proposals Approach to Instance Segmentation for Intestinal Polyp Detection

Iterative feedback-based models for image and video polyp segmentation

HIGF-Net: Hierarchical Information-Guided Fusion Network for Polyp Segmentation Based on Transformer and Convolution Feature Learning

A Coarse-to-fine Segmentation Frame for Polyp Segmentation Via Deep and Classification Features

Efficient Polyp Segmentation Via Integrity Learning

Improving Polyp Segmentation with Boundary-Assisted Guidance and Cross-Scale Interaction Fusion Transformer Network

CAFE-Net: Cross-Attention and Feature Exploration Network for polyp segmentation

Unveiling camouflaged and partially occluded colorectal polyps: Introducing CPSNet for accurate colon polyp segmentation

Learnable Oriented-Derivative Network for Polyp Segmentation

Polyp-Mamba: A Hybrid Multi-Frequency Perception Gated Selection Network for polyp segmentation

MCE-Net: polyp segmentation with multiple branch series-parallel attention and channel interaction via edge distribution guidance

MCSF-Net: a multi-scale channel spatial fusion network for real-time polyp segmentation

EPPS: Advanced Polyp Segmentation via Edge Information Injection and Selective Feature Decoupling