Abstract:To quantitatively and qualitatively evaluate a deep learning auto contouring model for prostate radiotherapy patients with pretreatment insertion of a hydrogel spacer (about water equivalent with no contrast) between prostate and rectum. The model employs convolutional neural networks (CNN) to learn features from input images that can be used to generate semantic segmentation. The study used 163 patients from three specialized GU radiation oncologists (referred to as A/B/C). The first 135 patients (A/B/C = 82/39/14) were used for training (125) and validation (10). The validation patients were randomly selected. The validated model was tested on 28 patients (A/B/C = 18/6/4) accrued during model development. There was no change of practice during the whole period. A simulation CT and MR were taken on the same day for each patient. In manual contouring, with MR fused to CT, spacer was contoured on T2 MR, prostate on CT with MR guidance, and other structures on CT only. The model was trained to auto contour prostate, proximal seminal vesicles (SV), bladder, rectum, penile bulb, femurs and spacer on CT without MR. Quantitatively, auto contours were evaluated against manual contours using the following metrics: sensitivity (% of voxels correctly drawn), false positive rate (FPR, % of voxels overdrawn), dice similarity coefficient (DSC), 95-percentile of Hausdorff distance (HD) and mean distances (dmean) between the two contours over all slices. The structures with high DSC were qualitatively evaluated by the original attending using a 1 (acceptable with minor editing), 2 (editable with efficiency gain over manual contouring) and 3 (rejected for no efficiency gain or gross error) scoring system. A gross error on rectum occurred for two patients (A/B = 1/1). These two points were excluded from quantitative analysis but counted as rejected in qualitative evaluation. On average, DSC was high for femurs (>0.95) and bladder (0.91), moderate for prostate (0.85) and rectum (0.81), but low for bulb (0.67), proximal SV (0.62) and spacer (0.52). For right femur/left femur/bladder/prostate/rectum, sensitivity = 0.93/0.92/0.88/0.86/0.81, FPR = 1.8%/1.5%/4.5%/15%/17%, 95% 95%-HD = 2.8/2.6/12.1/7.4/9.5 mm, and dmean = 0.9/1.0/2.6/2.5/2.4 mm. Qualitatively, femurs scored 1 in all cases. The average scores for bladder/prostate/rectum = 1.28/1.44/1.50, 1.83/2.17/1.67, 1.25/1.50/1.25 for physicians A, B, C, respectively, and 1.39/1.61/1.50 overall. Prostate and rectum both scored well below 2, despite their lower quantitative performance, as some errors caused by the inaccurate prediction of spacer without MR were deemed easily correctable by the physicians. The model produced clinically satisfactory results, both quantitatively and qualitatively, for femurs, bladder, prostate and rectum. The results for proximal SV and bulb were less ideal. The model drew the spacer in the correct location, but could not draw it accurately due to lack of contrast on CT.

Knowledge-based quality assurance of a comprehensive set of organ at risk contours for head and neck radiotherapy

Comprehensive and Clinically Accurate Head and Neck Cancer Organs-at-risk Delineation on a Multi-Institutional Study

Comprehensive and Clinically Accurate Head and Neck Organs at Risk Delineation Via Stratified Deep Learning: A Large-scale Multi-Institutional Study

Dosimetric Analysis of Radiation Treatment Plans Based on a Deep Learning Auto Contouring Model for Patients with Localized Prostate Cancer

Qualitative and Quantitative Analysis of a Deep Learning Auto Contouring Model for Radiotherapy in Localized Prostate Cancer.

Automated Clinical Target Volume Delineation Using Deep 3D Neural Networks in Radiation Therapy of Non-small Cell Lung Cancer

Quantitative and Qualitative Evaluation of a Deep Learning Auto Contouring Model for Prostate Cancer Patients with Hydrogel Spacer

Automated Clinical Target Volume Delineation for Non-Small Cell Lung Cancer Patients Using Deep 3D Networks

Quality assurance of organs-at-risk delineation in radiotherapy

Investigation on performance of multiple AI-based auto-contouring systems in organs at risks (OARs) delineation

Synthetic MRI-aided Head-and-Neck Organs-at-Risk Auto-Delineation for CBCT-guided Adaptive Radiotherapy

Dosimetric impact of contour editing on CT and MRI deep‐learning autosegmentation for brain OARs

A multi-modal vision-language pipeline strategy for contour quality assurance and adaptive optimization

Machine Learning-Based Quality Assurance for Automatic Segmentation of Head-and-Neck Organs-at-Risk in Radiotherapy

Performance analysis and knowledge-based quality assurance of critical organ auto-segmentation for pediatric craniospinal irradiation

Evaluating automatically generated normal tissue contours for safe use in head and neck and cervical cancer treatment planning

Comparative Clinical Evaluation of Deep-Learning-Based Algorithms in Auto-Segmentation of Organs-At-Risk for Head and Neck Cancers

Comprehensive evaluation of a deep learning model for automatic organs at risk segmentation on heterogeneous computed tomography images for abdominal radiotherapy

Clinical acceptability of automatically generated lymph node levels and structures of deglutition and mastication for head and neck radiation therapy

A Deep Learning Based Automatic Segmentation Approach for Anatomical Structures in Intensity Modulation Radiotherapy

Novel dosimetric validation of a commercial CT scanner based deep learning automated contour solution for prostate radiotherapy