Transfer learning for auto‐segmentation of 17 organs‐at‐risk in the head and neck: Bridging the gap between institutional and public datasets

Brett Clark,Nicholas Hardcastle,Leigh A. Johnston,James Korte
DOI: https://doi.org/10.1002/mp.16997
IF: 4.506
2024-02-22
Medical Physics
Abstract:Background Auto‐segmentation of organs‐at‐risk (OARs) in the head and neck (HN) on computed tomography (CT) images is a time‐consuming component of the radiation therapy pipeline that suffers from inter‐observer variability. Deep learning (DL) has shown state‐of‐the‐art results in CT auto‐segmentation, with larger and more diverse datasets showing better segmentation performance. Institutional CT auto‐segmentation datasets have been small historically (n 1000 aggregated) have become available through online repositories such as The Cancer Imaging Archive. Transfer learning is a technique applied when training samples are scarce, but a large dataset from a closely related domain is available. Purpose The purpose of this study was to investigate whether a large public dataset could be used in place of an institutional dataset (n > 500), or to augment performance via transfer learning, when building HN OAR auto‐segmentation models for institutional use. Methods Auto‐segmentation models were trained on a large public dataset (public models) and a smaller institutional dataset (institutional models). The public models were fine‐tuned on the institutional dataset using transfer learning (transfer models). We assessed both public model generalizability and transfer model performance by comparison with institutional models. Additionally, the effect of institutional dataset size on both transfer and institutional models was investigated. All DL models used a high‐resolution, two‐stage architecture based on the popular 3D U‐Net. Model performance was evaluated using five geometric measures: the dice similarity coefficient (DSC), surface DSC, 95th percentile Hausdorff distance, mean surface distance (MSD), and added path length. Results For a small subset of OARs (left/right optic nerve, spinal cord, left submandibular), the public models performed significantly better (p
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?