An Integrative Machine Learning Approach in Detecting Gene Networks Predicting Liver Cancer Incidence and Survival

Stephan,Murshalina,Md Tamzid Islam,Mihaela Sardiu,Yanming Li
DOI: https://doi.org/10.17161/kjm.vol17.22665
2024-08-27
Kansas Journal of Medicine
Abstract:Introduction. Liver cancer (LC) is one of the most common types of cancer and the third largest cause of cancer-related deaths globally. Although there is no known cause of LC, the disease has some associated risk factors. Studies have also linked genetic mutations to LC. Patients who receive early diagnosis have a five-year relative survival rate of ~36%, whereas ~3% for late diagnoses.Methods. In this study, we employed a novel machine learning technique called netLDA which initially identified predictive strong DE genes and used as them hubs genes to identify coregulating genes that may only exhibit marginally weak DE effects. The detected genes form predictive gene networks (PGNs), which are used to make predictions.Results. Our study detected previously identified LC occurrence susceptibility strong genes; GPC3, STMN1 and TCF19. Potential biomarker genes for LC occurrence such as MT1E and PLVAP were detected as co-regulating weak genes. In our one-year survival study, SPP1 and ADH4 were detected as strong genes whilst MFAP4 was identified as novel co-regulating weak gene. The PGNs achieved an AUC of about 1.00 in our case-control study and 0.77 in our one-year survival study. Some of the detected genes overlapped with some significant LC related pathways to provide additional layers of biological significance.Conclusions. With our study, the identified predictive genes can propel studies into understanding the underlying mechanisms for LC development and prognosis. These identified biomarker genes can be used as new therapeutic targets by industries in the development of new drugs for LC.
What problem does this paper attempt to address?