Detection of Cardiovascular Diseases Using Data Mining Approaches: Application of an Ensemble-Based Model

Mojdeh Nazari,Hassan Emami,Reza Rabiei,Azamossadat Hosseini,Shahabedin Rahmatizadeh
DOI: https://doi.org/10.1007/s12559-024-10306-z
IF: 4.89
2024-05-31
Cognitive Computation
Abstract:Cardiovascular diseases are the leading contributor of mortality worldwide. Accurate cardiovascular disease prediction is crucial, and the application of machine learning and data mining techniques could facilitate decision-making and improve predictive capabilities. This study aimed to present a model for accurate prediction of cardiovascular diseases and identifying key contributing factors with the greatest impact. The Cleveland dataset besides the locally collected dataset, called the Noor dataset, was used in this study. Accordingly, various data mining techniques besides four ensemble learning-based models were implemented on both datasets. Moreover, a novel model for combining individual classifiers in ensemble learning, wherein weights were assigned to each classifier (using a genetic algorithm), was developed. The predictive strength of each feature was also investigated to ensure the generalizability of the outcomes. The ultimate ensemble-based model achieved a precision rate of 88.05% and 90.12% on the Cleveland and Noor datasets, respectively, demonstrating its reliability and suitability for future research in predicting the likelihood of cardiovascular diseases. Not only the proposed model introduces an innovative approach for specifying cardiovascular diseases by unraveling the intricate relationships between various biological variables but also facilitates early detection of cardiovascular diseases.
computer science, artificial intelligence,neurosciences
What problem does this paper attempt to address?