35P Exploring a Machine Learning Approach to Predict Tumour Type from Targeted Panel DNA Sequence Data

L. Xiong,B. Zhang,D. Zhang
DOI: https://doi.org/10.1016/j.annonc.2020.08.187
IF: 51.769
2020-01-01
Annals of Oncology
Abstract:Some tumour types carry specific genomic alterations, and genomic alterations across numerous different cancers may guide the inference of tumour origin. Therefore, we have explored the feasibility of identifying tumour type based on genomic alterations. To predict tumour site of origin, 11867 cases were included in this study. We constructed a random forest classifier using a training cohort of 9493 patients representing 21 cancer types, and the best parameters were determined from 5-fold cross-validation of the training data. Genomic profiling of DNA was performed on formalin-fixed paraffin-embedded tumour samples through NGS with a panel of 381 cancer-related genes. In our test set of 2374 patients, we accurately predicted tumour type in 1163 cases (49.0% of cases) based on 5-fold cross-validation. The positive predictive value was highest in tumour types with distinctive molecular profiles, or larger sample size (e.g. lung cancer). These results suggest that the application of artificial intelligence has the potential to predict tissue of origin for a tumour by using mutation data.
What problem does this paper attempt to address?