Computational Identification of Preneoplastic Cells Displaying High Stemness and Risk of Cancer Progression

Tianyuan Liu,Xuan Zhao,Yuan Lin,Qi Luo,Shaosen Zhang,Yiyi Xi,Yamei Chen,Lin,Wenyi Fan,Jie Yang,Yuling Ma,Alok K. Maity,Yanyi Huang,Jianbin Wang,Jiang Chang,Dongxin Lin,Andrew E. Teschendorff,Chen Wu
DOI: https://doi.org/10.1158/0008-5472.can-22-0668
IF: 11.2
2022-01-01
Cancer Research
Abstract:Evidence points toward the differentiation state of cells as a marker of cancer risk and progression. Measuring the differentiation state of single cells in a preneoplastic population could thus enable novel strategies for early detection and risk prediction. Recent maps of somatic mutagenesis in normal tissues from young healthy individuals have revealed cancer driver mutations, indicating that these do not correlate well with differentiation state and that other molecular events also contribute to cancer development. We hypothesized that the differentiation state of single cells can be measured by estimating the regulatory activity of the transcription factors (TF) that control differentiation within that cell lineage. To this end, we present a novel computational method called CancerStemID that estimates a stemness index of cells from single-cell RNA sequencing data. CancerStemID is validated in two human esophageal squamous cell carcinoma (ESCC) cohorts, demonstrating how it can identify undifferentiated preneoplastic cells whose transcriptomic state is overrepresented in invasive cancer. Spatial transcriptomics and whole-genome bisulfite sequencing demonstrated that differentiation activity of tissue-specific TFs was decreased in cancer cells compared with the basal cell-of-origin layer and established that differentiation state correlated with differential DNA methylation at the promoters of these TFs, independently of underlying NOTCH1 and TP53 mutations. The findings were replicated in a mouse model of ESCC development, and the broad applicability of CancerStemID to other cancer-types was demonstrated. In summary, these data support an epigenetic stem-cell model of oncogenesis and highlight a novel computational strategy to identify stem-like preneoplastic cells that undergo positive selection.SIGNIFICANCE:This study develops a computational strategy to dissect the heterogeneity of differentiation states within a preneoplastic cell population, allowing identification of stem-like cells that may drive cancer progression.
What problem does this paper attempt to address?