Abstract:Background: Diabetic retinopathy (DR) is a prevalent microvascular complication of diabetes mellitus, and early detection is crucial for effective management. Metabolomics profiling has emerged as a promising approach for identifying potential biomarkers associated with DR progression. This study aimed to develop a hybrid explainable artificial intelligence (XAI) model for targeted metabolomics analysis of patients with DR, utilizing a focused approach to identify specific metabolites exhibiting varying concentrations among individuals without DR (NDR), those with non-proliferative DR (NPDR), and individuals with proliferative DR (PDR) who have type 2 diabetes mellitus (T2DM). Methods: A total of 317 T2DM patients, including 143 NDR, 123 NPDR, and 51 PDR cases, were included in the study. Serum samples underwent targeted metabolomics analysis using liquid chromatography and mass spectrometry. Several machine learning models, including Support Vector Machines (SVC), Random Forest (RF), Decision Tree (DT), Logistic Regression (LR), and Multilayer Perceptrons (MLP), were implemented as solo models and in a two-stage ensemble hybrid approach. The models were trained and validated using 10-fold cross-validation. SHapley Additive exPlanations (SHAP) were employed to interpret the contributions of each feature to the model predictions. Statistical analyses were conducted using the Shapiro–Wilk test for normality, the Kruskal–Wallis H test for group differences, and the Mann–Whitney U test with Bonferroni correction for post-hoc comparisons. Results: The hybrid SVC + MLP model achieved the highest performance, with an accuracy of 89.58%, a precision of 87.18%, an F1-score of 88.20%, and an F-beta score of 87.55%. SHAP analysis revealed that glucose, glycine, and age were consistently important features across all DR classes, while creatinine and various phosphatidylcholines exhibited higher importance in the PDR class, suggesting their potential as biomarkers for severe DR. Conclusion: The hybrid XAI models, particularly the SVC + MLP ensemble, demonstrated superior performance in predicting DR progression compared to solo models. The application of SHAP facilitates the interpretation of feature importance, providing valuable insights into the metabolic and physiological markers associated with different stages of DR. These findings highlight the potential of hybrid XAI models combined with explainable techniques for early detection, targeted interventions, and personalized treatment strategies in DR management.

Explainable artificial intelligence (XAI) to find optimal in-silico biomarkers for cardiac drug toxicity evaluation

A stacking ensemble machine learning model for evaluating cardiac toxicity of drugs based on in silico biomarkers

Explainable Artificial Intelligence for Pharmacovigilance: What Features Are Important When Predicting Adverse Outcomes?

Evaluation of cardiac pro-arrhythmic risks using the artificial neural network with ToR–ORd in silico model output

Drug safety assessment by machine learning models

Artificial Intelligence and Machine Learning Methods to Evaluate Cardiotoxicity following the Adverse Outcome Pathway Frameworks

Prediction of Drug-Induced TdP Risks Using Machine Learning and Rabbit Ventricular Wedge Assay

Artificial Intelligence and Machine Learning Models for Predicting Drug-Induced Kidney Injury in Small Molecules

Development of models for predicting Torsade de Pointes cardiac arrhythmias using perceptron neural networks

Artificial intelligence to assist decision-making on pharmacotherapy: A feasibility study

An intelligent biosensing platform using phase space reconstruction‐assisted convolutional neural network for drug‐induced cardiotoxicity assessment

Semi-Supervised Learning to Boost Cardiotoxicity Prediction by Mining a Large Unlabeled Small Molecule Dataset

Multi-labeled Neural Network Model for Automatically Processing Cardiomyocyte Mechanical Beating Signals in Drug Assessment.

Adaptability of AI for safety evaluation in regulatory science: A case study of drug-induced liver injury

Artificial Intelligence in Drug Toxicity Prediction: Recent Advances, Challenges, and Future Perspectives

Editorial: Pharmacogenetics.

Identification of Optimal Machine Learning Algorithms and Molecular Fingerprints for Explainable Toxicity Prediction Models Using ToxCast/Tox21 Bioassay Data

Hybrid Explainable Artificial Intelligence Models for Targeted Metabolomics Analysis of Diabetic Retinopathy

Design and standards for genetic evaluation of swine seedstock populations.

Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values

Enhanced drug classification using machine learning with multiplexed cardiac contractility assays