Risk Assessment of Cardiovascular Disease Based on SOLSSA-CatBoost Model

Xi Wei,Congjun Rao,Xinping Xiao,Lin Chen,Mark Goh
DOI: https://doi.org/10.1016/j.eswa.2023.119648
IF: 8.5
2023-02-05
Expert Systems with Applications
Abstract:Cardiovascular disease (CVD) has become a significant public health problem affecting national economic and social development, and ranks among the top causes of death in the world. Thus, people pay increasing attention to the prevention, control, and risk assessment of CVD. In this paper, an improved sparrow search algorithm (SSA) is designed to optimize the parameters of Categorical Boosting (CatBoost) model, and it is applied to the risk assessment of CVD. The contributions of this research are mainly in the following aspects: (1) In the position update formula of the discoverer, the salp swarm algorithm is integrated, the global optimal solution of the previous generation is added to improve the global search ability and local development ability of SSA; (2) Using Opposition-based Learning (OBL) and Lateral mutation strategy to improve the search ability of the worst individual; (3) Sparrow search algorithm based on salp swarm algorithm, OBL and Lateral mutation strategy (SOLSSA) is used to optimize parameters of CatBoost to improve the prediction effect, and the experiments are carried out for the proposed model (SOLSSA-CatBoost) using two CVD data sets on Kaggle. The proposed model is compared with six machine learning models, including random forest (RF), logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM), light gradient Boosting (LGB) and CatBoost, and is also compared with other four optimization algorithms (whale optimization algorithm (WOA), gray wolf algorithm (GWO), seagull optimization algorithm (SOA) and SSA) in optimizing the performance of the CatBoost. The experimental results show that compared with other comparison algorithms, SOLSSA-CatBoost has better prediction effect on test set, with F1-score reaching 90% and 81.51% in two CVD data sets respectively. The SOLSSA-CatBoost model in this paper can make a more accurate prediction of patients' disease risk, and provide a certain basis for doctors to judge the condition.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
What problem does this paper attempt to address?