Abstract:Purpose: Heart illness is one of the major killers of humans worldwide. Heart illness and the possibility of experiencing a heart attack have both increased in recent years. Medical professionals face significant difficulties when attempting to forecast heart disease. One of the medical field's virtuosi is early prediction, and this is particularly true in cardiology. The early prediction model-building studies illuminated the most up-to-date methods for locating variations in medical imaging. The study of computer-assisted diagnosis is a dynamic and quickly developing field. Since wrong medical diagnoses can lead to dangerous treatments, a lot of work has been done recently to enhance computer programs that help doctors make diagnoses. Computer-assisted diagnosis relies heavily on machine learning. The basic aspect of pattern recognition is the capability to learn from precedents. Pattern identification and artificial intelligence have a lot of promise to improve the accuracy with which biomedical professionals perceive and diagnose illness. They also help make decisions more objectively. Machine learning is a promising method for developing elegant and automatic algorithms for the study of high-dimensional and multimodal bio-medical data. Two heart disease-related datasets were considered for the purpose of this research. The study implements several machine learning algorithms and compares their prediction accuracy and a handful of other performance metrics to determine which one is the most effective. Objective: The primary goal of the research is to evaluate the performance of several machine learning algorithms using different evaluation criteria such as f1 score, roc, and auc values. The aim is to discover the most effective machine learning algorithm for the datasets obtained for the study. Design/Methodology/Approach: The research utilizes datasets from Kaggle heart information. Python, Skilearn, Pandas, and Jupyter Notebook have been used to build various machine learning prediction models and the outcomes have been compared. Findings/Results: Both datasets comprise of different parameters, therefore pre-processing had to be customized. Applying machine learning algorithms to the training dataset and comparing the trained models to the testing dataset yielded varied results for each dataset. Model performance was measured by accuracy and AUC. Both datasets gave good results with boosting algorithms, however the Cleveland dataset did better with decision trees. Originality/Value: The research included an examination of two Kaggle heart databases. It has been seen how data is distributed, how various features depend on each other, and how all the features influence the target feature of heart disease prediction. Models have been constructed and trained using different machine learning methods, each with its own set of hyper-tuning parameters. To learn which machine learning model is most effective for a given collection of data, the study has looked into both the prediction results using the trained models and the performance parameters of the individual models. Through this study, we now know more about how different machine learning methods work. To determine the most effective algorithm, it is necessary to conduct additional research of the datasets using Deep Learning techniques. Paper Type: Comparative Study

Comparison and analysis of applications of ID3, CART decision tree models and neural network model in medical diagnosis and prognosis evaluation

Comparative analysis of weka-based classification algorithms on medical diagnosis datasets

Decision Curve Analysis: a Technical Note

Application of a Decision Tree Model for Predicting Diabetic Retinopathy

Evaluation of Machine Learning Algorithms for the Prognosis of Breast Cancer from the Surveillance, Epidemiology, and End Results Database

Opening the Black Box of Neural Networks: Methods for Interpreting Neural Network Models in Clinical Applications

Decision making model to predict presence of coronary artery disease using neural network and C5.0 decision tree

An optimal method for diagnosing heart disease using combination of grasshopper evalutionary algorithm and support vector machines

A Non-Parametric Method for the Comparison of Partial Areas under ROC Curves and Its Application to Large Health Care Data Sets.

Parametric optimization and comparative study of machine learning and deep learning algorithms for breast cancer diagnosis

Application and comparison of several machine learning methods in the prognosis of cervical cancer

Improvement of APACHE II score system for disease severity based on XGBoost algorithm

Indepth Analysis of Medical Dataset Mining: A Comparitive Analysis on a Diabetes Dataset Before and After Preprocessing

Comparison the Performance of Rough Set Theory with Decision Tree and Regress Analysis in Prediction the Degree of Glioma on routine MR images

Prediction of Coronary Artery Disease using Machine Learning – A Comparative study of Algorithms

A Novel Pre-processing Method for Classification Problems in Medical Intelligent Tasks

Dealing with the Missing, Imbalanced and Sparse Features Problems in Emergency Data Using Random Forest, K-means and PCA Respectively (Preprint)

A Gradient-Boosted Decision-Tree Algorithm for the Prediction of Short-Term Mortality in Acute Heart Failure Patients

The Application and Comparison of Machine Learning Models for the Prediction of Breast Cancer Prognosis: Retrospective Cohort Study

Analyzing the Performance of Machine Learning Techniques in Disease Prediction

Heart Disease Diagnosis Using Decision Trees with Feature Selection Method