Abstract:ObjectiveDiabetes is a chronic fatal disease that has affected millions of people all over the globe. Type 2 Diabetes Mellitus (T2DM) accounts for 90% of the affected population among all types of diabetes. Millions of T2DM patients remain undiagnosed due to lack of awareness and under resourced healthcare system. So, there is a dire need for a diagnostic and prognostic tool that shall help the healthcare providers, clinicians and practitioners with early prediction and hence can recommend the lifestyle changes required to stop the progression of diabetes. The main objective of this research is to develop a framework based on machine learning techniques using only lifestyle indicators for prediction of T2DM disease. Moreover, prediction model can be used without visiting clinical labs and hospital readmissions.MethodA proposed framework is presented and implemented based on machine learning paradigms using lifestyle indicators for better prediction of T2DM disease. The current research has involved different experts like Diabetologists, Endocrinologists, Dieticians, Nutritionists, etc. for selecting the contributing 1552 instances and 11 attributes lifestyle biological features to promote health and manage complications towards T2DM disease. The dataset has been collected through survey and google forms from different geographical regions.ResultsSeven machine learning classifiers were employed namely K-Nearest Neighbour (KNN), Linear Regression (LR), Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT), Random Forest (RF) and Gradient Boosting (GB). Gradient Boosting classifier outperformed best with an accuracy rate of 97.24% for training and 96.90% for testing separately followed by RF, DT, NB, SVM, LR, and KNN as 95.36%, 92.52%, 90.72%, 90.20%, 90.20% and 77.06% respectively. However, in terms of precision, RF achieved high performance (0.980%) and KNN performed the lowest (0.793%). As far as recall is being concerned, GB achieved the highest rate of 0.975% and KNN showed the worst rate of 0.774%. Also, GB is top performed in terms of f1-score. According to the ROCs, GB and NB had a better area under the curve compared to the others.ConclusionThe research developed a realistic health management system for T2DM disease based on machine learning techniques using only lifestyle data for prediction of T2DM. To extend the current study, these models shall be used for different, large and real-time datasets which share the commonality of data with T2DM disease to establish the efficacy of the proposed system.

Balancing Acts: Tackling Data Imbalance in Machine Learning for Predicting Myocardial Infarction in Type 2 Diabetes

A machine learning-based approach for the prediction of periprocedural myocardial infarction by using routine data

An Enhanced Machine Learning Framework for Type 2 Diabetes Classification Using Imbalanced Data with Missing Values

Mitigating class imbalance in heart disease detection with machine learning

Deep Learning Based Cardiovascular Disease Risk Factor Prediction Among Type 2 Diabetes Mellitus Patients

Machine Learning Approach with Harmonized Multinational Datasets for Enhanced Prediction of Hypothyroidism in Patients with Type 2 Diabetes

Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches

Addressing Class Imbalance in Healthcare Data: Machine Learning Solutions for Age-Related Macular Degeneration and Preeclampsia

1290-P: Developing Machine Learning Model for Predicting Acute Coronary Syndrome in Type 2 Diabetes Mellitus Patients through Substitution of Propensity Scores for Binary Variables

Enhancing severe hypoglycemia prediction in type 2 diabetes mellitus through multi-view co-training machine learning model for imbalanced dataset

Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm

Machine learning-driven predictions and interventions for cardiovascular occlusions

Machine Learning Approaches for Type 2 Diabetes Prediction and Care Management

Machine learning based predictive model of Type 2 diabetes complications using Malaysian National Diabetes Registry: A study protocol

Advancements In Heart Disease Prediction: A Machine Learning Approach For Early Detection And Risk Assessment

Supervised Machine Learning based Ensemble Model for Accurate Prediction of Type 2 Diabetes

Machine Learning as a Support for the Diagnosis of Type 2 Diabetes

Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases

Percutaneous ethanol injection in the treatment of hepatocellular carcinoma in cirrhosis: a simple, effective and cheap procedure for percutaneous ablation.

Hybrid Prediction Model for Type-2 Diabetes Mellitus using Machine Learning Approach

A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus