Identifying Metabolic Syndrome Easily and Cost Effectively Using Non-Invasive Methods with Machine Learning Models
Wei Xu,Zikai Zhang,Kerong Hu,Ping Fang,Ran Li,Dehong Kong,Miao Xuan,Yang Yue,Dunmin She,Ying Xue
DOI: https://doi.org/10.2147/DMSO.S413829
2023-07-18
Abstract:Wei Xu, 1, &ast Zikai Zhang, 2, &ast Kerong Hu, 1, &ast Ping Fang, 1 Ran Li, 1 Dehong Kong, 1 Miao Xuan, 1 Yang Yue, 3 Dunmin She, 4, 5 Ying Xue 1 1 Department of Endocrinology and Metabolism, Tongji Hospital, School of Medicine, Tongji University, Shanghai, People's Republic of China; 2 Department of Oncology, Tongji Hospital, School of Medicine, Tongji University, Shanghai, People's Republic of China; 3 School of Mathematics and Statistics, University of Melbourne, Melbourne, Australia; 4 Clinical Medical College, Yangzhou University, Yangzhou, Jiangsu, People's Republic of China; 5 Department of Endocrinology, Northern Jiangsu People's Hospital Affiliated to Yangzhou University, Yangzhou, Jiangsu, People's Republic of China &astThese authors contributed equally to this work Correspondence: Ying Xue, Department of Endocrinology and Metabolism, Tongji Hospital, School of Medicine, Tongji University, No. 389, Xincun Road, Shanghai, People's Republic of China, Tel +86-21-66111061, Email Dunmin She, Clinical Medical College, Yangzhou University, No. 98, Nantong West Road, Yangzhou, Jiangsu, People's Republic of China, Email Purpose: The objective of this study was to employ machine learning (ML) models utilizing non-invasive factors to achieve early and low-cost identification of MetS in a large physical examination population. Patients and Methods: The study enrolled 9171 participants who underwent physical examinations at Northern Jiangsu People's Hospital in 2009 and 2019, to determine MetS based on criteria established by the Chinese Diabetes Society. Non-invasive characteristics such as gender, age, body mass index (BMI), systolic blood pressure (SBP), and diastolic blood pressure (DBP) were collected and used as input variables to train and evaluate ML models for MetS identification. Several ML models were used for MetS identification, including logistic regression (LR), k-nearest neighbors algorithm (k-NN), naive bayesian (NB), decision tree (DT), random forest (RF), artificial neural network (ANN), and support vector machine (SVM). Results: Our ML models all showed good performance in the 10-fold cross-validation except for the SVM model. In the external validation, the NB model exhibited the best performance with an AUC of 0.976, accuracy of 0.923, sensitivity of 98.32%, and specificity of 91.32%. Conclusion: This study proposed a new non-invasive method for early and low-cost identification of MetS by using ML models. This approach has the potential to serve as a highly sensitive, convenient, and cost-effective tool for large-scale MetS screening. Keywords: metabolic syndrome, machine learning methods, non-invasive method, naive Bayesian Metabolic syndrome (MetS) refers to a set of combined metabolic disorders, including obesity, hypertension (HBP), hyperglycemia, and dyslipidemia such as elevated triglycerides (TG), and reduced high-density lipoprotein cholesterol (HDL-C), 1 which is one of the vital risk factors for accelerating progression of cardiovascular disease (CVD), 2 chronic kidney disease (CKD) 3 and type 2 diabetes (T2DM). 4–6 Several criteria defining MetS in adults have been established by expert panels from organizations such as the National Cholesterol Education Program's Adult Treatment Panel III (NCEP: ATP III), 4 the World Health Organization (WHO), 7 the European Group for the Study of Insulin Resistance, 8 the International Diabetes Federation (IDF) 1 and the Chinese Diabetes Society. 9 However, there is still no globally accepted definition of MetS. MetS is a growing global health concern, 10 with an estimated number of affected individuals exceeding one billion and continuing to rise. 11 In a study conducted in the United States, a weighted prevalence of MetS was found to be 34.7% among 17,048 adults, increasing to 48.6% among those aged 60 years or older. 12 In another survey of 8814 individuals in the United States, the prevalence of MetS was over 40% among those aged 60–69 years. 13 The prevalence of MetS in the Chinese population is relatively lower compared to populations in developed countries. In a survey of 2975 subjects in China, the prevalence of MetS was found to be 12.6%. 14 A cross-sectional survey conducted in 2001 in 31 Chinese provinces and cities found the standardized prevalence of MetS was 13.7% (9.8% for men and 17.8% for women). 15 It is crucial to find ways to early diagnose patients at high risk for MetS, all -Abstract Truncated-
endocrinology & metabolism