Abstract:In various countries worldwide, the incidence of colon cancer-related deaths has been on the rise in recent years. Early detection of symptoms and identification of intestinal polyps are crucial for improving the cure rate of colon cancer patients. Automated computer-aided diagnosis (CAD) has emerged as a solution to the low efficiency of traditional methods relying on manual diagnosis by physicians. Deep learning is the latest direction of CAD development and has shown promise for colonoscopic polyp segmentation. In this paper, we present a multi-level encoder-decoder architecture for polyp segmentation based on the Transformer architecture, termed NA-SegFormer. To improve the performance of existing Transformer-based segmentation algorithms for edge segmentation on colon polyps, we propose a patch merging module with a neighbor attention mechanism based on overlap patch merging. Since colon tract polyps vary greatly in size and different datasets have different sample sizes, we used a unified focal loss to solve the problem of category imbalance in colon tract polyp data. To assess the effectiveness of our proposed method, we utilized video capsule endoscopy and typical colonoscopy polyp datasets, as well as a dataset containing surgical equipment. On the datasets Kvasir-SEG, Kvasir-Instrument and KvasirCapsule-SEG, the Dice score of our proposed model reached 94.30%, 94.59% and 82.73%, with an accuracy of 98.26%, 99.02% and 81.84% respectively. The proposed method achieved inference speed with an Frame-per-second (FPS) of 125.01. The results demonstrated that our suggested model effectively segmented polyps better than several well-known and latest models. In addition, the proposed method has advantages in trade-off between inference speed and accuracy, and it will be of great significance to real-time colonoscopic polyp segmentation. The code is available at https://github.com/promisedong/NAFormer.

PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion

Probabilistic Modeling Ensemble Vision Transformer Improves Complex Polyp Segmentation

Polyp Segmentation With the FCB-SwinV2 Transformer

FCB-SwinV2 Transformer for Polyp Segmentation

FCN-Transformer Feature Fusion for Polyp Segmentation

SegT: A Novel Separated Edge-guidance Transformer Network for Polyp Segmentation

Polyp Segmentation Using a Hybrid Vision Transformer and a Hybrid Loss Function

ECTransNet: An Automatic Polyp Segmentation Network Based on Multi-scale Edge Complementary

Automatic Polyp Segmentation in Colonoscopy Images Using a Modified Deep Convolutional Encoder-Decoder Architecture

RetSeg: Retention-based Colorectal Polyps Segmentation Network

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation

Transformer-Enhanced Iterative Feedback Mechanism for Polyp Segmentation

Colorectal Polyp Segmentation Based on Group Convolution and Transformer

Multi‐scale nested UNet with transformer for colorectal polyp segmentation

Utilizing adaptive deformable convolution and position embedding for colon polyp segmentation with a visual transformer

Dual encoder–decoder-based deep polyp segmentation network for colonoscopy images

TMPSformer: An Efficient Hybrid Transformer-MLP Network for Polyp Segmentation

PolyPooling: An accurate polyp segmentation from colonoscopy images

MugenNet: A Novel Combined Convolution Neural Network and Transformer Network with its Application for Colonic Polyp Image Segmentation

Maharishi international university: "science of creative intelligence%".