Enhancing Clinical Validation for Early Cardiovascular Disease Prediction through Simulation, AI, and Web Technology

Md Abu Sufian,Wahiba Hamzi,Sadia Zaman,Lujain Alsadder,Boumediene Hamzi,Jayasree Varadarajan,Md Abul Kalam Azad
DOI: https://doi.org/10.3390/diagnostics14121308
IF: 3.6
2024-06-21
Diagnostics
Abstract:Cardiovascular diseases (CVDs) remain a major global health challenge and a leading cause of mortality, highlighting the need for improved predictive models. We introduce an innovative agent-based dynamic simulation technique that enhances our AI models' capacity to predict CVD progression. This method simulates individual patient responses to various cardiovascular risk factors, improving prediction accuracy and detail. Also, by incorporating an ensemble learning model and interface of web application in the context of CVD prediction, we developed an AI dashboard-based model to enhance the accuracy of disease prediction and provide a user-friendly app. The performance of traditional algorithms was notable, with Ensemble learning and XGBoost achieving accuracies of 91% and 95%, respectively. A significant aspect of our research was the integration of these models into a streamlit-based interface, enhancing user accessibility and experience. The streamlit application achieved a predictive accuracy of 97%, demonstrating the efficacy of combining advanced AI techniques with user-centered web applications in medical prediction scenarios. This 97% confidence level was evaluated by Brier score and calibration curve. The design of the streamlit application facilitates seamless interaction between complex ML models and end-users, including clinicians and patients, supporting its use in real-time clinical settings. While the study offers new insights into AI-driven CVD prediction, we acknowledge limitations such as the dataset size. In our research, we have successfully validated our predictive proposed methodology against an external clinical setting, demonstrating its robustness and accuracy in a real-world fixture. The validation process confirmed the model's efficacy in the early detection of CVDs, reinforcing its potential for integration into clinical workflows to aid in proactive patient care and management. Future research directions include expanding the dataset, exploring additional algorithms, and conducting clinical trials to validate our findings. This research provides a valuable foundation for future studies, aiming to make significant strides against CVDs.
medicine, general & internal
What problem does this paper attempt to address?
The main goal of this paper is to enhance the clinical validation of early prediction of cardiovascular diseases (CVD) by combining dynamic simulation techniques, artificial intelligence (AI) models, and web technologies. Specifically, the research team developed an innovative agent-based dynamic simulation technique to improve the ability of AI models to predict the progression of CVD. This approach can simulate individual patients' different responses to various cardiovascular risk factors, thereby improving the accuracy and detail of predictions. Below is an overview of the key issues this study attempts to address: 1. **Research Questions and the Importance of Early Detection**: - Managing diverse patient data to ensure accurate diagnosis (data diversity management). - Identifying important features that affect diagnostic accuracy (feature selection). - Optimizing these features to better indicate the presence of disease (feature engineering). - Addressing data bias from non-disease indicators (data imbalance). - Implementing strategies to ensure balanced data representation and reduce diagnostic bias (bias correction). 2. **Research Objectives**: - Identify and analyze key features that significantly impact cardiovascular disease outcomes. - Develop and implement strategies to maintain class balance in predictive modeling. - Evaluate and compare the performance of different machine learning (ML) models in predicting cardiovascular diseases. - Assess user trust and acceptance of AI-driven diagnostic tools. - Explore the possibility of integrating AI and dynamic simulation into real-time platforms for continuous health monitoring and risk assessment. 3. **Research Methods**: - Use agent-based models (ABM) to simulate the progression of CVD in individual patients. - Employ information gain feature selection techniques to identify key features. - Address data imbalance issues using Synthetic Minority Over-sampling Technique (SMOTE). - Construct five classical models (XGBoost, logistic regression, random forest, ensemble learning, decision tree), with reasons for selecting these models including performance, speed, interpretability, and ability to handle high-dimensional data. 4. **Experimental Analysis**: - Data exploration revealed the distribution of the target variable, showing a class imbalance issue in the dataset. - Demographic analysis showed a male dominance (68%), with females accounting for 31.68%. - Analysis of the trend of heart disease frequency with age. In summary, this study aims to improve the accuracy of early prediction of cardiovascular diseases by integrating dynamic simulation, machine learning techniques, and web applications, with detailed methodology and experimental design for practical application.