Assessment of EMR ML Mining Methods for Measuring Association between Metal Mixture and Mortality for Hypertension

Site Xu,Mu Sun
DOI: https://doi.org/10.1007/s40292-024-00666-w
Abstract:Introduction: There are limited data available regarding the connection between heavy metal exposure and mortality among hypertension patients. Aim: We intend to establish an interpretable machine learning (ML) model with high efficiency and robustness that monitors mortality based on heavy metal exposure among hypertension patients. Methods: Our datasets were obtained from the US National Health and Nutrition Examination Survey (NHANES, 2013-2018). We developed 5 ML models for mortality prediction among hypertension patients by heavy metal exposure, and tested them by 10 discrimination characteristics. Further, we chose the optimally performing model after parameter adjustment by genetic algorithm (GA) for prediction. Finally, in order to visualize the model's ability to make decisions, we used SHapley Additive exPlanation (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME) algorithm to illustrate the features. The study included 2347 participants in total. Results: A best-performing eXtreme Gradient Boosting (XGB) with GA for mortality prediction among hypertension patients by 13 heavy metals was selected (AUC 0.959; 95% CI 0.953-0.965; accuracy 96.8%). According to sum of SHAP values, cadmium (0.094), cobalt (2.048), lead (1.12), tungsten (0.129) in urine, and lead (2.026), mercury (1.703) in blood positively influenced the model, while barium (- 0.001), molybdenum (- 2.066), antimony (- 0.398), tin (- 0.498), thallium (- 2.297) in urine, and selenium (- 0.842), manganese (- 1.193) in blood negatively influenced the model. Conclusions: Hypertension patients' mortality associated with heavy metal exposure was predicted by an efficient, robust, and interpretable GA-XGB model with SHAP and LIME. Cadmium, cobalt, lead, tungsten in urine, and mercury in blood are positively correlated with mortality, while barium, molybdenum, antimony, tin, thallium in urine, and lead, selenium, manganese in blood is negatively correlated with mortality.
What problem does this paper attempt to address?