Integrating Dual Attention with TransUNet for Esophagus Image Segmentation

Zhehao Yu,Shaochen Jiang,Liejun Wang
DOI: https://doi.org/10.1117/12.3033544
2024-01-01
Abstract:Esophageal cancer is a common digestive tract tumor with a high mortality rate. Currently, the identification of esophageal carcinoma predominantly hinges on thoracic computed tomography scanning. Because the area of the esophagus on CT images is small and belongs to the small target region, the traditional image segmentation network is difficult to accurately segment the tumor area of esophageal cancer. Although the traditional U-Net architecture and Transformer integrated variants perform well in medical segmentation tasks, they lack image localization and channel feature extraction capabilities. To address the aforementioned issues, this manuscript proposes a novel medical image segmentation model, christened DAA-TransUNet. We propose a new module called Dual Attention Area (DAA) module,it can extract image position and channel features. And we combine the DAA module and the Efficient Channel Attention (ECA) module into a new module, which we call the Efficient Channel Dual Attention (ECDA) module. DAA-TransUNet integrates Transformer and ECDA modules into a traditional U-shaped architecture, enabling it to extract not only global and local information, but also image position and channel features. We have verified the superior performance of our network by performing numerous experiments on our collected esophageal dataset. We used DAA-TransUNet to perform experiments on our collected dataset, and the average DSC(%) reached 75.26% and the average HD95(mm) reached 3.54.
What problem does this paper attempt to address?