DBT-UNETR: Double Branch Transformer with Cross Fusion for 3D Medical Image Segmentation

Haojie Tao,Keming Mao,Yuhai Zhao
DOI: https://doi.org/10.1109/bibm55620.2022.9995409
2022-01-01
Abstract:Medical image segmentation is significant for diagnosis and prognosis. Inspired by the success of Transformer for Natural Language Processing, this paper presents DBT-UNETR, a novel double branch Transformer architecture with cross fusion for multi-scale feature representation, which enhances the model performance of 3D medical image segmentation. The DBT-UNETR consists of a single scale Transformer and a multiple scale Transformer to capture informative feature. These two branch features are cross fused and then they are combined together. Finally, these feature representations are sent to decoders via skip connection for final segmentation. Moreover, in order to further improve the model performance, an Early Fusion module is designed. Comprehensive experiments are conducted on public available dataset, Multi-Atlas Labeling Beyond The Cranial Vault (BTCV), and it demonstrates the proposed model outperforms the comparative baseline methods.
What problem does this paper attempt to address?