Construction of immune-related molecular diagnostic and predictive models of hepatocellular carcinoma based on machine learning

Hui Zheng,Xu Han,Qian Liu,Li Zhou,Yawen Zhu,Jiaqi Wang,Wenjing Hu,Fengcai Zhu,Ran Liu
DOI: https://doi.org/10.1016/j.heliyon.2024.e24854
IF: 3.776
2024-01-17
Heliyon
Abstract:Background: To exploit hepatocellular carcinoma (HCC) diagnostic substances, we identify potential predictive markers based on machine learning and to explore the significance of immune cell infiltration in this pathology. Method: Three HCC gene expression datasets were used for weighted gene co-expression network analysis (WGCNA) and differential expression analysis. Least Absolute Shrinkage and Selection Operator (LASSO) and Random Forest were applied to identify candidate biomarkers. The diagnostic value of HCC diagnostic gene biomarkers was further assessed by the area under the ROC curve observed in the validation dataset. CIBERSORT was used to analyze 22 immune cell fractions from HCC patients and to analyze their correlation with diagnostic markers. In addition, the prognostic value of the markers and the sensitivity of the drugs were analyzed. Result: WGCNA and differential expression analysis were used to screen 396 distinct gene signatures in HCC tissues. They were mostly engaged in cytoplasmic fusion and the cell division cycle, according to gene enrichment analyses. Five genes were shown to have a high diagnostic value for use as diagnostic biomarkers for HCC, including EFHD1 (AUC = 0.77), KIF4A (AUC = 0.97), UBE2C (AUC = 0.96), SMYD3 (AUC = 0.91), and MCM7 (AUC = 0.93). T cells, NK cells, macrophages, and dendritic cells were found to be related to diagnostic markers in HCC tissues by immune cell infiltration analysis, indicating that these cells are intimately linked to the onset and spread of HCC. Concurrently, these five genes and their constructed models have considerable prognostic value. Conclusion: These five genes (EFHD1, KIF4A, UBE2C, SMYD3, and MCM7) may serve as new candidate molecular markers for HCC, providing new insights for future diagnosis, prognosis, and molecular therapy of HCC.
What problem does this paper attempt to address?