Adenocarcinoma Segmentation Using Pre-trained Swin-UNet with Parallel Cross-Attention for Multi-Domain Imaging

Abdul Qayyum,Moona Mazher Imran Razzak,Steven A Niederer
2024-09-24
Abstract:Computer aided pathological analysis has been the gold standard for tumor diagnosis, however domain shift is a significant problem in histopathology. It may be caused by variability in anatomical structures, tissue preparation, and imaging processes challenges the robustness of segmentation models. In this work, we present a framework consist of pre-trained encoder with a Swin-UNet architecture enhanced by a parallel cross-attention module to tackle the problem of adenocarcinoma segmentation across different organs and scanners, considering both morphological changes and scanner-induced domain variations. Experiment conducted on Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation challenge dataset showed that our framework achieved segmentation scores of 0.7469 for the cross-organ track and 0.7597 for the cross-scanner track on the final challenge test sets, and effectively navigates diverse imaging conditions and improves segmentation accuracy across varying domains.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the issue of domain adaptation in adenocarcinoma segmentation. Specifically, the research team proposes a framework based on a pre-trained Swin-UNet architecture combined with a parallel cross-attention module to tackle the challenges of adenocarcinoma segmentation across different organs and scanning devices. This framework takes into account morphological variations and domain differences caused by scanning devices. Experiments on the Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation (COSAS-2024) challenge dataset demonstrate that the framework achieves segmentation scores of 0.7469 and 0.7597 for cross-organ and cross-scanner tasks, respectively, effectively improving segmentation accuracy and robustness under different imaging conditions.