Pseudo-grading of tumor subpopulations from single-cell transcriptomic data using Phenotype Algebra

Namrata Bhattacharya,Anja Rockstroh,Sanket Suhas Deshpande,Sam Koshy Thomas,Anunay Yadav,Chitrita Goswami,Smriti Chawla,Pierre Solomon,Cynthia Fourgeux,Gaurav Ahuja,Brett G. Hollier,Himanshu Kumar,Antoine Roquilly,Jeremie Poschmann,Melanie Lehman,Colleen C. Nelson,Debarka Sengupta
DOI: https://doi.org/10.1101/2022.12.28.522060
2024-04-01
Abstract:Single-cell RNA-sequencing (scRNA-seq) coupled with robust computational analysis facilitates the characterization of phenotypic heterogeneity within tumors. Current scRNA-seq analysis pipelines are capable of identifying a myriad of malignant and non-malignant cell subtypes from single-cell profiling of tumors. However, given the extent of intra-tumoral heterogeneity, it is challenging to assess the risk associated with individual malignant cell subpopulations, primarily due to the complexity of the cancer phenotype space and the lack of clinical annotations associated with tumor scRNA-seq studies. To this end, we introduce SCellBOW, a scRNA-seq analysis framework inspired by document embedding techniques from the domain of Natural Language Processing (NLP). SCellBOW is a novel computational approach that facilitates effective identification and high-quality visualization of single-cell subpopulations. We compared SCellBOW with existing best practice methods for its ability to precisely represent phenotypically divergent cell types across multiple scRNA-seq datasets, including our in-house generated human splenocyte and matched peripheral blood mononuclear cell (PBMC) dataset. For malignant cells, SCellBOW estimates the relative risk associated with each cluster and stratifies them based on their aggressiveness. This is achieved by simulating how the presence or absence of a specific malignant cell subpopulation influences disease prognosis. Using SCellBOW, we identified a hitherto unknown and pervasive AR−/NE (androgen-receptor-negative, neuroendocrine-low) malignant subpopulation in metastatic prostate cancer with conspicuously high aggressiveness. Overall, the risk-stratification capabilities of SCellBOW hold promise for formulating tailored therapeutic interventions by identifying clinically relevant tumor subpopulations and their impact on prognosis.
Bioinformatics
What problem does this paper attempt to address?