UMF-Net: A UNet-based Multi-Branch Feature Fusion Network for Colon Polyp Segmentation

Yulong Wan,Dongming Zhou,Changcheng Wang
DOI: https://doi.org/10.1016/j.bspc.2024.106851
2025-01-01
Abstract:The early diagnosis of colorectal cancer heavily relies on colonoscopy, but clinical examinations often face challenges in detecting polyps due to various influencing factors. Polyp segmentation models can be categorized into two architectures: those based on convolutional neural networks (CNNs), which specialize in local modeling, and those based on Transformers, which excel at capturing global context. However, existing methods that combine these architectures often do so simply without addressing the large parameter size of Transformer architectures and the fixed weights problem of convolutional networks. In our study, we propose a Transformer-to-CNNs block by combining advanced components from Transformers with deformable convolutions (DCN) to replace the Transformer block. Deformable convolutions (DCN) enable learning offsets to capture long-range dependencies, apply adaptive weights, and select only a fixed number of pixels, thereby reducing the parameter size effectively. Based on approach, we introduce a novel architecture called UMF-Net (UNet-based Multi-Branch Feature Fusion Network), which adopts a multi-branch architecture and a unique feature fusion strategy to aggregate features from these branches. Additionally, we incorporate a Residual Coarse-to-Fine block to enhance the model's learning capabilities further. In our evaluation process, we assess the performance of our proposed method using metrics such as mDice, mIoU, F-beta(omega), S-alpha, mE(xi), and MAE on four widely used benchmark datasets. The experimental results demonstrate that UMF-Net outperforms other state-of-the-art methods while effectively improving colon polyp segmentation accuracy.
What problem does this paper attempt to address?