Abstract:Early diagnosis of heart disease is crucial, as it's one of the leading causes of death globally. Machine learning algorithms can be a powerful tool in achieving this goal. Therefore, this article aims to increase the accuracy of predicting heart disease using machine learning algorithms. Five classification models are explored: eXtreme Gradient Boosting (XGBC), Random Forest Classifier (RFC), Decision Tree Classifier (DTC), K-Nearest Neighbors Classifier (KNNC), and Logistic Regression Classifier (LRC). Additionally, four optimizers are evaluated: Slime mold Optimization Algorithm, Forest Optimization Algorithm, Pathfinder algorithm, and Giant Armadillo Optimization. To ensure robust model selection, a feature selection technique utilizing k-fold cross-validation is employed. This method identifies the most relevant features from the data, potentially improving model performance. The top three performing models are then coupled with the optimization algorithms to potentially enhance their generalizability and accuracy in predicting heart failure. In the final stage, the shortlisted models (XGBC, RFC, and DTC) were assessed using performance metrics like accuracy, precision, recall, F1-score, and Matthews Correlation Coefficient (MCC). This rigorous evaluation identified the XGGA hybrid model as the top performer, demonstrating its effectiveness in predicting heart failure. XGGA achieved impressive metrics, with an accuracy, precision, recall, and F1-score of 0.972 in the training phase, underscoring its robustness. Notably, the model's predictions deviated by less than 5.5 % for patients classified as alive and by less than 1.2 % for those classified as deceased compared to the actual outcomes, reflecting minimal error and high predictive reliability. In contrast, the DTC base model was the least effective, with an accuracy of 0.840 and a precision of 0.847. Overall, the optimization using the GAO algorithm significantly enhanced the performance of the models, highlighting the benefits of this approach.

Decision Combination Based on the Characterisation of Predictive Accuracy.

Optimal Linear Combination Using Decision Reliability Of Individual Classifiers

An Algorithmic Framework for Constructing Multiple Decision Trees by Evaluating Their Combination Performance Throughout the Construction Process

Reconciling Model Multiplicity for Downstream Decision Making

Combined Classifier for Cross-Project Defect Prediction: an Extended Empirical Study.

Token-based Decision Criteria Are Suboptimal in In-context Learning

Learning Human-Compatible Representations for Case-Based Decision Support

A hybrid machine learning framework for analyzing human decision making through learning preferences

Decision Forest: Combining the Predictions of Multiple Independent Decision Tree Models

Combining Predictions of Auto Insurance Claims

Combination prediction method of students' performance based on ant colony algorithm

Learning from Batched Data: Model Combination Versus Data Combination

To combine or not to combine: selecting among forecasts and their combinations

Conditional probability generation methods for high reliability effects-based decision making

Refining heart disease prediction accuracy using hybrid machine learning techniques with novel metaheuristic algorithms

Non-intrusive model combination for learning dynamical systems

Model Combination In The Multiple-Data-Batches Scenario

A Unified Approach to Interpreting Model Predictions

Correcting Predictions for Approximate Bayesian Inference

Hacking a surrogate model approach to XAI

The Conditional Prediction Function: A Novel Technique to Control False Discovery Rate for Complex Models