Lung Tumor Segmentation of PET/CT Using Dual Pyramid Mask R-CNN

Tonghe Wang,Yang Lei,Sibo Tian,Tian Liu,Walter J. Curran,Kristin Higgins,Xiaofeng Yang
DOI: https://doi.org/10.1117/12.2580987
2021-01-01
Abstract:We propose a learning-based method to automatically segment lung tumor from positron emission tomography (PET)/computed tomography (CT) images. A dual pyramid mask R-CNN is introduced to enable end-to-end segmentation. To avoid the effect of useless region, mask R-CNN is used to get rid of non-tumor regions via fist locate the tumor region-of-interest (ROI) and then segment tumor within that ROI. Dual pyramid networks are used as backbone in mask R-CNN to extract comprehensive features from both CT and PET images. The binary mask of tumor of an arrival patient’s CT and PET image is generated by the welltrained network. To evaluate the proposed method, we retrospectively investigate 42 lung PET/CT datasets. On each dataset, lung tumor was delineated by physicians and was served as ground truth and training target. The proposed method was trained and evaluated by a six-fold cross validation strategy. The average centroid distance, volume difference and DSC value among all 42 datasets are 0.496±0.933mm, 0.303±0.458cc and 0.894±0.080, which indicates that the proposed method is able to generate target contour within 0.5mm error in displacement, 0.3cc error in volume size and around 90% overlapping compared with ground truth. The proposed method has great potential in improving the efficiency and mitigating the observer-dependence in tumor detection and delineation for diagnosis and therapy.
What problem does this paper attempt to address?