Single-cell RNA-seq Analysis to Identify Potential Biomarkers for Diagnosis, and Prognosis of Non-Small Cell Lung Cancer by Using Comprehensive Bioinformatics Approaches.

Adiba Sultana,Md Shahin Alam,Xingyun Liu,Rohit Sharma,Rajeev K. Singla,Rohit Gundamaraju,Bairong Shen
DOI: https://doi.org/10.1016/j.tranon.2022.101571
IF: 4.803
2023-01-01
Translational Oncology
Abstract:Non-small cell lung cancer (NSCLC) is the most common type of lung cancer and the leading cause of cancer-related deaths worldwide. Identification of gene biomarkers and their regulatory factors and signaling pathways is very essential to reveal the molecular mechanisms of NSCLC initiation and progression. Thus, the goal of this study is to identify gene biomarkers for NSCLC diagnosis and prognosis by using scRNA-seq data through bioinformatics techniques. scRNA-seq data were obtained from the GEO database to identify DEGs. A total of 158 DEGs (including 48 upregulated and 110 downregulated) were detected after gene integration. Gene Ontology enrichment and KEGG pathway analysis of DEGs were performed by FunRich software. A PPI network of DEGs was then constructed using the STRING database and visualized by Cytoscape software. We identified 12 key genes (KGs) including MS4A1, CCL5, and GZMB, by using two topological methods based on the PPI networking results. The diagnostic, expression, and prognostic potentials of the identified 12 key genes were assessed using the receiver operating characteristics (ROC) curve and a web-based tool, SurvExpress. From the regulatory network analysis, we extracted the 7 key transcription factors (TFs) (FOXC1, YY1, CEBPB, TFAP2A, SREBF2, RELA, and GATA2), and 8 key miRNAs (hsa-miR-124-3p, hsa-miR-34a-5p, hsa-miR-21-5p, hsa-miR-155-5p, hsa-miR-449a, hsa-miR-24-3p, hsa-let-7b-5p, and hsa-miR-7-5p) associated with the KGs were evaluated. Functional enrichment and pathway analysis, survival analysis, ROC analysis, and regulatory network analysis highlighted crucial roles of the key genes. Our findings might play a significant role as candidate biomarkers in NSCLC diagnosis and prognosis.
What problem does this paper attempt to address?