Hybrid-ctunet: a double complementation approach for 3D medical image segmentation

Dong Wang,Kun Shang,Dong Liang,Yanjie Zhu
DOI: https://doi.org/10.1007/s13042-024-02469-w
2024-12-11
International Journal of Machine Learning and Cybernetics
Abstract:Medical segmentation is a fundamental problem in medical image computing, and it finds wide application in clinical domains such as medical diagnosis and robotic surgery. In this work, we investigate the distinct spatial characteristics of CNNs and Transformers in their representations of local and global features, while also analyzing the differences in preservation of spatial position within their network structures. They provide a comprehensive explanation of the complementarity between CNN and Transformer. To promote effective complementarity, we propose two novel architectures, namely CUNet and TUNet, which individually preserve the spatial characteristics throughout the overall U-Net process of the encoder and decoder. For feature complementation, we incorporate CUNet and TUNet as parallel branches, named CTUNet, which enhances the long-range dependencies of global information in both the deep and shallow locality. Moreover, we design the binary cross-weights for element-wise addition to achieve a more prominent fusion of features with diverse spatial characteristics. For further mask complementation, we construct a Hybrid-CTUNet by integrating the jointly training CTUNet and the independently training TUNet. Extensive empirical analysis conducted on medical datasets confirms the superiority of our proposed method compared to state-of-the-art models. The reproducing code is available at https://github.com/shouwangzhe134/Hybrid-CTUNet.git.
computer science, artificial intelligence
What problem does this paper attempt to address?