ECSFF: Exploring Efficient Cross-Scale Feature Fusion for Medical Image Segmentation.

Yan Ke,Shui Yu,Zhonglai Wang,Yun Li
DOI: https://doi.org/10.1109/icac57885.2023.10275282
2023-01-01
Abstract:In medical image segmentation, the greatly successful U-Net structure has become the industry standard and has achieved great success. However, existing methods all directly fuse high-dimensional features with low-dimensional ones without considering the difference between high-dimensional and low-dimensional features, making it difficult to fit in small data scenarios and hence limiting the performance. This paper proposes a transformer-based method for feature fusion. Specifically, anchor attention and self-attention are used to extract features. The anchor attention effectively fuses high-dimensional and low-dimensional features, while self-attention extracts general features. Fusion of the high-dimensional and low-dimensional features solves the semantic gap between the two features. At the same time, the fusion model draws on comparative learning to guide the skipped connection and upsampling for the fusion. In particular, this Cross-Scale fusion method uses a downsampled feature embeded as a negative sample, and uses the skipped connection output feature and the upsampled recovery feature as a positive sample. It then compares the direction of the regular constraint fusion feature, thus reducing model complexity. Experimental results show that this Cross-Scale fusion method produces more accurate segmentation performance, improving the Dice index by 2%-3% on the ISIC2018 dataset and the SegPC2021 dataset.
What problem does this paper attempt to address?