ITUnet: Integration Of Transformers And Unet For Organs-At-Risk Segmentation

Hongyu Kan,Jun Shi,Minfan Zhao,Zhaohui Wang,Wenting Han,Hong An,Zhaoyang Wang,Shuo Wang
DOI: https://doi.org/10.1109/EMBC48229.2022.9871945
Abstract:Recently, convolutional neural network(CNN) has achieved great success in medical image segmentation. However, due to the limitation of convolutional receptive field, the pure convolutional neural network is difficult to further improve its performance. Given the outstanding ability of transformers in extracting the long-range dependency, some works have successfully applied it to computer vision and achieved better results than CNN in some tasks. Based on transformers could remedy the shortage of CNN, in this paper, we propose ITUnet, a segmentation network using CNN and transformers as features extractor. The combination of CNN and transformers enables the network to learn both short- and long-range dependency of features, which is beneficial to segmentation tasks. We evaluate our method on a head-and-neck CT dataset which has 18 kinds of organs to be segmented. The experimental results demonstrate that our proposed method shows better accuracy and robustness, the proposed methods achieve the Dice score of 77.72 and the 95% Hausdorff Distance of 2.31, outperforming the existing methods.
What problem does this paper attempt to address?