Multifactorial Deep Learning Reveals Pan-Cancer Genomic Tumor Clusters with Distinct Immunogenomic Landscape and Response to Immunotherapy.

Feng Xie,Jianjun Zhang,Jiayin Wang,Alexandre Reuben,Wei Xu,Xin Yi,Frederick S. Varn,Yongsheng Ye,Junwen Cheng,Miao Yu,Yue Wang,Yufeng Liu,Mingchao Xie,Peng Du,Ke Ma,Xin Ma,Penghui Zhou,Shengli Yang,Yaobing Chen,Guoping Wang,Xuefeng Xia,Zhongxing Liao,John V. Heymach,Ignacio I. Wistuba,P. Andrew Futreal,Kai Ye,Chao Cheng,Tian Xia
DOI: https://doi.org/10.1158/1078-0432.ccr-19-1744
IF: 13.801
2020-01-01
Clinical Cancer Research
Abstract:Purpose: Tumor genomic features have been of particular interest because of their potential impact on the tumor immune micro-environment and response to immunotherapy. Due to the substantial heterogeneity, an integrative approach incorporating diverse molecular features is needed to characterize immunologic features underlying primary resistance to immunotherapy and for the establishment of novel predictive biomarkers. Experimental Design: We developed a pan-cancer deep machine learning model integrating tumor mutation burden, microsatellite instability, and somatic copy-number alterations to classify tumors of different types into different genomic clusters, and assessed the immune microenvironment in each genomic cluster and the association of each genomic cluster with response to immunotherapy. Results: Our model grouped 8,646 tumors of 29 cancer types from The Cancer Genome Atlas into four genomic clusters. Analysis of RNA-sequencing data revealed distinct immune microenvironment in tumors of each genomic class. Furthermore, applying this model to tumors from two melanoma immunotherapy clinical cohorts demonstrated that patients with melanoma of different genomic classes achieved different benefit from immunotherapy. Interestingly, tumors in cluster 4 demonstrated a cold immune microenvironment and lack of benefit from immunotherapy despite high microsatellite instability burden. Conclusions: Our study provides a proof for principle that deep learning modeling may have the potential to discover intrinsic statistical cross-modality correlations of multifactorial input data to dissect the molecular mechanisms underlying primary resistance to immunotherapy, which likely involves multiple factors from both the tumor and host at different molecular levels.
What problem does this paper attempt to address?