Abstract:Type 2 diabetes mellitus (T2DM) is associated with accelerated development of atherosclerosis and a reduced life expectancy. Using machine learning (ML) to identify novel characteristics from electronic medical records (EMRs) associated with prognosis may increase prognostic precision and find new targets for investigation and treatment. We used a novel ML approach to investigate the use of EMRs to predict all-cause mortality in two ethnically and geographically different populations; one from the West of Scotland (WoS) and the other from Hong Kong. We obtained EMRs including demographics, prior comorbidities, laboratory measurements, medications, and mortality. Multivariable Cox regression and time-dependent random forest model were used to identify predictors of all-cause mortality in patients with T2DM. Subsequently, we applied a state-of-the-art ML interpretability method, to gain further insight into the key predictors. In WoS, 46,031 individuals received a new diagnosis of T2DM between 2009 and 2019. Their median age was 66 (IQR: 56 to 75) years. Within 10 years, 11,727 (25%) deaths were recorded. In Hong Kong, 273,876 patients with a first-attendance with T2DM at public hospitals or clinics were included, with follow-up until December 2019. The median age of the patients was 64 (IQR: 57 to 72) years. Within 10 years, 91,155 (33%) deaths were recorded. For both cohorts, the strongest predictor for all-cause mortality was prescription of loop diuretics (Figure 1). For the WoS, other important predictors were greater age, lower serum albumin, elevated alanine transaminase (ALT), increased alkaline phosphatase (ALP), and lower estimated glomerular function rate (eGFR) (c-index: 0.83; Brier score: 0.07). For Hong Kong, predictive variables were remarkably similar and included greater age, lower eGFR, lower haemoglobin and lymphocytes, lower serum albumin, and elevated ALP (c-index: 0.85; Brier score: 0.06). Multivariable Cox regression adjusting for age and sex showed a higher mortality amongst those prescribed loop diuretics compared to those who were not (WoS: hazard ratio: 1.549, 95% CI: 1.521 to 1.575; Hong Kong: hazard ratio: 1.745 (95% CI: 1.721 to 1.769). Only a minority of patients prescribed loop diuretics had a diagnosis of heart failure, end-stage renal disease or resistant hypertension. Predictors of all-cause Mortality

Machine learning and statistical models to predict all-cause mortality in type 2 diabetes: Results from the UK Biobank study

Predicting mortality of type-2 diabetes mellitus by applying machine learning to electronic medical records in the west of scotland and hong kong

Prognostic Machine Learning Models for First-Year Mortality in Incident Hemodialysis Patients: Development and Validation Study.

Machine Learning for the Prediction of First-Year Mortality in Incident Hemodialysis Patients

Improving Cardiovascular Risk Prediction Through Machine Learning Modelling of Irregularly Repeated Electronic Health Records

Comparison of Machine Learning Techniques for Mortality Prediction in a Prospective Cohort of Older Adults

Developing a prediction model for all‐cause mortality risk among patients with type 2 diabetes mellitus in Shanghai, China

Comparing the accuracy of four machine learning models in predicting type 2 diabetes onset within the Chinese population: a retrospective study

Machine Learning Models in Type 2 Diabetes Risk Prediction: Results from a Cross-sectional Retrospective Study in Chinese Adults

Interpretable machine learning models for the prediction of all‐cause mortality and time to death in hemodialysis patients

Machine learning-based models to predict one-year mortality among Chinese older patients with coronary artery disease combined with impaired glucose tolerance or diabetes mellitus

Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study

A 10-year retrospective cohort of diabetic patients in a large medical institution: Utilizing multiple machine learning models for diabetic kidney disease prediction

An enhanced machine learning algorithm for type 2 diabetes prognosis with a detailed examination of Key correlates

Machine Learning-Based Predictive Model for Mortality in Female Breast Cancer Patients Considering Lifestyle Factors

Comparison of prediction models for cardiovascular and mortality risk in people with type 2 diabetes: An external validation in 23 685 adults included in the UK Biobank

Machine learning predicts long-term mortality after acute myocardial infarction using systolic time intervals and routinely collected clinical data

Machine learning for the prediction of atherosclerotic cardiovascular disease during 3-year follow up in Chinese type 2 diabetes mellitus patients

Development of machine learning-based models to predict 10-year risk of cardiovascular disease: a prospective cohort study

Identifying top ten predictors of type 2 diabetes through machine learning analysis of UK Biobank data

Unveiling the Hidden Burden: Estimating All-Cause Mortality Risk in Older Individuals with Type 2 Diabetes