Risk Prediction for Non-cardiac Surgery Using the 12-Lead Electrocardiogram: An Explainable Deep Learning Approach
Carl W Harris,Anway Pimpalkar,Ataes Aggarwal,Jiyuan Yang,Xiaojian Chen,Samuel Schmidgall,Sampath Rapuri,Joseph L Greenstein,Casey Overby Taylor,Robert David Stevens
DOI: https://doi.org/10.1101/2024.11.19.24317577
2024-11-20
Abstract:Background
To improve on existing noncardiac surgery risk scores, we propose a novel approach which leverages features of the preoperative 12-lead electrocardiogram (ECG) to predict major adverse postoperative outcomes.
Methods
Data acquired in 37,060 adult patients prior to major noncardiac surgery were used to train a series of convolutional neural network models in the task of predicting in-hospital acute myocardial infarction (MI), in-hospital mortality (IHM), and a composite of in-hospital MI, in-hospital stroke, and 30-day mortality. Preoperative ECG waveform features were first modeled as sole inputs then integrated with clinical variables in fusion models. Model discrimination was evaluated using area under the receiver operating characteristic (AUROC) analysis, and performances were compared with the Revised Cardiac Risk Index (RCRI), a benchmark preoperative risk score To gain interpretable insight, a generative approach using counterfactual ECGs was implemented.
Results
The ECG fusion model had an AUROC of 0.858 (95% CI [0.845,0.872]), 0.899 (95% CI [0.889,0.908]), and 0.835 (95% CI [0.827,0.843]) in predicting MI, IHM, and the composite endpoint, respectively; these AUROC values were significantly higher than in models based on ECG waveforms alone (MI: p = 0.001, IHM: p<10^(-4), composite: p<10^(-4)). All ECG based models had significantly higher discrimination than the RCRI. Counterfactual ECG analysis revealed morphological features relevant to outcome classification.
Conclusion
A deep learning approach integrating preoperative ECG waveform features significantly enhances the ability to predict major outcomes after noncardiac surgery. The use of counterfactual ECGs provides plausible explanations for classifier decisions, increasing the interpretability of the models.