Abstract:Background: Coronary artery disease (CAD) is still a lethal disease worldwide. This study aims to identify clinically relevant diagnostic biomarker in CAD and explore the potential medications on CAD. Methods: GSE42148, GSE180081, and GSE12288 were downloaded as the training and validation cohorts to identify the candidate genes by constructing the weighted gene co-expression network analysis. Functional enrichment analysis was utilized to determine the functional roles of these genes. Machine learning algorithms determined the candidate biomarkers. Hub genes were then selected and validated by nomogram and the receiver operating curve. Using CIBERSORTx, the hub genes were further discovered in relation to immune cell infiltrability, and molecules associated with immune active families were analyzed by correlation analysis. Drug screening and molecular docking were used to determine medications that target the four genes. Results: There were 191 and 230 key genes respectively identified by the weighted gene co-expression network analysis in two modules. A total of 421 key genes found enriched pathways by functional enrichment analysis. Candidate immune-related genes were then screened and identified by the random forest model and the eXtreme Gradient Boosting algorithm. Finally, four hub genes, namely, CSF3R, EED, HSPA1B, and IL17RA, were obtained and used to establish the nomogram model. The receiver operating curve, the area under curve, and the calibration curve were all used to validate the accuracy and usefulness of the diagnostic model. Immune cell infiltrating was examined, and CAD patients were then divided into high- and low-expression groups for further gene set enrichment analysis. Through targeting the hub genes, we also found potential drugs for anti-CAD treatment by using the molecular docking method. Conclusions: CSF3R, EED, HSPA1B, and IL17RA are potential diagnostic biomarkers for CAD. CAD pathogenesis is greatly influenced by patterns of immune cell infiltration. Promising drugs offers new prospects for the development of CAD therapy.

Improving generalization of machine learning-identified biomarkers using causal modelling with examples from immune receptor diagnostics

Improving generalization of machine learning-identified biomarkers with causal modeling: an investigation into immune receptor diagnostics

Causal Inference and Counterfactual Prediction in Machine Learning for Actionable Healthcare

The benefits and pitfalls of machine learning for biomarker discovery

Improving the accuracy of medical diagnosis with causal machine learning

Causal machine learning for healthcare and precision medicine

Disease diagnostics using machine learning of immune receptors

Causal Representation Learning from Multimodal Biological Observations

Machine learning for precision diagnostics of autoimmunity

Correct deconfounding enables causal machine learning for precision medicine and beyond

The immuneML ecosystem for machine learning analysis of adaptive immune receptor repertoires

Machine Learning Driven Biomarker Selection for Medical Diagnosis

Causality Refined Diagnostic Prediction

Development of a graphical model of causal gene regulatory networks using medical big data and Bayesian machine learning

Large Language Models as Co-Pilots for Causal Inference in Medical Studies

Causal modeling in large-scale data to improve identification of adults at risk for combined and common variable immunodeficiencies

Causal machine learning for predicting treatment outcomes

EAACI Guidelines on environmental science in allergic diseases and asthma – leveraging artificial intelligence and machine learning to develop a causality model in exposomics

Identification of diagnostic biomarkers and immune cell infiltration in coronary artery disease by machine learning, nomogram, and molecular docking

Improving Model's Interpretability and Reliability using Biomarkers