A radiogenomic dataset of non-small cell lung cancer

Shaimaa Bakr,Olivier Gevaert,Sebastian Echegaray,Kelsey Ayers,Mu Zhou,Majid Shafiq,Hong Zheng,Jalen Anthony Benson,Weiruo Zhang,Ann N. C. Leung,Michael Kadoch,Chuong D. Hoang,Joseph Shrager,Andrew Quon,Daniel L. Rubin,Sylvia K. Plevritis,Sandy Napel
DOI: https://doi.org/10.1038/sdata.2018.202
2018-01-01
Scientific Data
Abstract:Medical image biomarkers of cancer promise improvements in patient care through advances in precision medicine. Compared to genomic biomarkers, image biomarkers provide the advantages of being non-invasive, and characterizing a heterogeneous tumor in its entirety, as opposed to limited tissue available via biopsy. We developed a unique radiogenomic dataset from a Non-Small Cell Lung Cancer (NSCLC) cohort of 211 subjects. The dataset comprises Computed Tomography (CT), Positron Emission Tomography (PET)/CT images, semantic annotations of the tumors as observed on the medical images using a controlled vocabulary, and segmentation maps of tumors in the CT scans. Imaging data are also paired with results of gene mutation analyses, gene expression microarrays and RNA sequencing data from samples of surgically excised tumor tissue, and clinical data, including survival outcomes. This dataset was created to facilitate the discovery of the underlying relationship between tumor molecular and medical image features, as well as the development and evaluation of prognostic medical image biomarkers.
What problem does this paper attempt to address?