Abstract:Accurate and reliable segmentation of colorectal polyps is important for the diagnosis and treatment of colorectal cancer. Most of the existing polyp segmentation methods innovatively combine CNN with Transformer. Due to the single combination approach, there are limitations in establishing connections between local feature information and utilizing global contextual information captured by Transformer. Still not a better solution to the problems in polyp segmentation. In this paper, we propose a Dual Branch Multiscale Feature Fusion Network for Polyp Segmentation, abbreviated as DBMF, for polyp segmentation to achieve accurate segmentation of polyps. DBMF uses CNN and Transformer in parallel to extract multi-scale local information and global contextual information respectively, with different regions and levels of information to make the network more accurate in identifying polyps and their surrounding tissues. Feature Super Decoder (FSD) fuses multi-level local features and global contextual information in dual branches to fully exploit the potential of combining CNN and Transformer to improve the network's ability to parse complex scenes and the detection rate of tiny polyps. The FSD generates an initial segmentation map to guide the second parallel decoder (SPD) to refine the segmentation boundary layer by layer. SPD consists of a multi-scale feature aggregation module (MFA) and parallel polarized self-attention (PSA) and reverse attention fusion modules (RAF). MFA aggregates multi-level local feature information extracted by CNN Brach to find consensus regions between multiple scales and improve the network's ability to identify polyp regions. PSA uses dual attention to enhance the fine-grained nature of segmented regions and reduce the redundancy introduced by MFA and interference information. RAF mines boundary cues and establishes relationships between regions and boundary cues. The three RAFs guide the network to explore lost targets and boundaries in a bottom-up manner. We used the CVC-ClinicDB, Kvasir, CVC-300, CVC-ColonDB, and ETIS datasets to conduct comparison experiments and ablation experiments between DBMF and mainstream polyp segmentation networks. The results showed that DBMF outperformed the current mainstream networks on five benchmark datasets.

Meta-Polyp: a baseline for efficient Polyp segmentation

MetaFormer and CNN Hybrid Model for Polyp Image Segmentation

M^2UNet: MetaFormer Multi-scale Upsampling Network for Polyp Segmentation

Probabilistic Modeling Ensemble Vision Transformer Improves Complex Polyp Segmentation

Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation

Multi‐scale nested UNet with transformer for colorectal polyp segmentation

PolyPooling: An accurate polyp segmentation from colonoscopy images

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

RetSeg: Retention-based Colorectal Polyps Segmentation Network

Adaptation of Distinct Semantics for Uncertain Areas in Polyp Segmentation

Polyp-Mamba: A Hybrid Multi-Frequency Perception Gated Selection Network for polyp segmentation

SegT: A Novel Separated Edge-guidance Transformer Network for Polyp Segmentation

CTNet: Contrastive Transformer Network for Polyp Segmentation

CoAM-Net: Coordinate Asymmetric Multi-Scale Fusion Strategy for Polyp Segmentation

BCL-Former: Localized Transformer Fusion with Balanced Constraint for polyp image segmentation

BetterNet: An Efficient CNN Architecture with Residual Learning and Attention for Precision Polyp Segmentation

Know your orientation: A viewpoint-aware framework for polyp segmentation

TransNetR: Transformer-based Residual Network for Polyp Segmentation with Multi-Center Out-of-Distribution Testing

DBMF: Dual Branch Multiscale Feature Fusion Network for polyp segmentation

ECTransNet: An Automatic Polyp Segmentation Network Based on Multi-scale Edge Complementary

PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion