Transcriptional profiles reveal histologic origin and prognosis across 33 The Cancer Genome Atlas tumor types

Hui Xiao,Liang Hu,Qi Tan,Jinping Jia,Ping Xie,Junai Li,Minghua Wang
DOI: https://doi.org/10.21037/tcr-23-234
2023-10-31
Abstract:Background: In recent years, with the development of transcriptome sequencing, the molecular characteristics of tumors are gradually revealed. Because of the complexity of tumor transcriptome, there is a need to look for the molecular signatures which can be used to evaluate the tissue origin and cell stemness of tumors in order to promote the diagnosis and treatment of tumors. Methods: Tumor tissue-specific gene sets (TTSGs) consisting of 200 genes were selected using RNA expression data of 9,875 patients from 33 tumor types. t-distributed Stochastic Neighbor Embedding (t-SNE) was used for dimensionality reduction and visualization of TTSGs in each tumor type. To evaluate oncogenic dedifferentiation and loss of cell stemness, Euclidean distance from each sample to a human embryo single-cell RNA-seq dataset (GSE36552) of TTSGs was calculated as TTSGs index indicating dissimilarity of tumors and embryo. TTSGs index was evaluated for prognosis in each tumor type. Two published signature indexes, the mRNA signature index (mRNAsi) and CIBERSORT, were compared to assess the correlation between the TTSGs index with cell stemness and immune microenvironment. Finally, the difference of prognosis, immune microenvironment and radiotherapy outcomes were compared between patients with high and low TTSGs index. Results: In this study, all 33 tumor types in The Cancer Genome Atlas (TCGA) were embedded into isolated clusters by t-SNE and confirmed by k-nearest neighbors (kNN) algorithm. Clusters of squamous-cell carcinoma were adjacent to each other revealing similar histologic origin. Basal-like breast cancer was separated from luminal and HER-2-amplified subtypes and closed to squamous-cell carcinoma. TTSGs index was related to overall survival outcomes in cancers derived from liver, thyroid, brain, cervical and kidney. There was a positive correlation between mRNAsi and TTSGs index in pan-kidney and pan-neuronal cancers. Furthermore, cell fractions of M2 macrophages and total leukocytes increased in the group with higher TTSGs index. Patients with higher TTSGs index had longer overall survival time and less radiation therapy resistance compared to patients with lower TTSGs index. Conclusions: The signature of TTSGs is related to tumor expression features that distinguish tumors of different histologic origin using t-SNE. The signature also relates to prognosis of certain kinds of tumors.
What problem does this paper attempt to address?