SF-SegFormer: Stepped-Fusion Segmentation Transformer for Brain Tissue Image Via Inter-Group Correlation and Enhanced Multi-layer Perceptron

Jinjing Zhang,Lijun Zhao,Jianchao Zeng,Pinle Qin
DOI: https://doi.org/10.1007/978-3-031-12053-4_38
2022-01-01
Abstract:Many brain tissue segmentation methods generally utilize one-level fusion to explore complementary discrepancies among different modalities. However, this one-level fusion manner cannot fully explore potential characteristics of multi-modality images. To this end, we propose a multi-level fusion segmentation transformer framework (dubbed SF-SeFormer) for brain tissue segmentation. Specifically, the proposed SF-SegFormer consists of three parts: Double Paired-modality Encoding (DPE) network, Cross Feature Decoding (CFD) network and Semantical Double Boundary Generation (SDBG) branch. Firstly, our DPE network is introduced to extract features from two pairs of dual-modality for the first-level fusion. Secondly, we design CFD network for the second-level and the third-level fusion by using cross-feature updating block and Cross Feature Fusion (CFF) block. Thirdly, we propose multi-stage channel aggregation-based multi-layer perceptron to enrich channel-aggregation diversity for efficient feature representation. Besides, semantical double boundaries can help to distinguish brain tissues, so we design SDBG branch to predict boundary of each target region, which can regularize multi-resolution CFF features. A large number of experiments have shown that proposed method outperforms many state-of-the-art segmentation methods, when evaluating on BrainWeb dataset.
What problem does this paper attempt to address?