AttGRU-HMSI: enhancing heart disease diagnosis using hybrid deep learning approach

G. Madhukar Rao,Dharavath Ramesh,Vandana Sharma,Anurag Sinha,Md. Mehedi Hassan,Amir H. Gandomi
DOI: https://doi.org/10.1038/s41598-024-56931-4
IF: 4.6
2024-04-04
Scientific Reports
Abstract:Heart disease is a major global cause of mortality and a major public health problem for a large number of individuals. A major issue raised by regular clinical data analysis is the recognition of cardiovascular illnesses, including heart attacks and coronary artery disease, even though early identification of heart disease can save many lives. Accurate forecasting and decision assistance may be achieved in an effective manner with machine learning (ML). Big Data, or the vast amounts of data generated by the health sector, may assist models used to make diagnostic choices by revealing hidden information or intricate patterns. This paper uses a hybrid deep learning algorithm to describe a large data analysis and visualization approach for heart disease detection. The proposed approach is intended for use with big data systems, such as Apache Hadoop. An extensive medical data collection is first subjected to an improved k-means clustering (IKC) method to remove outliers, and the remaining class distribution is then balanced using the synthetic minority over-sampling technique (SMOTE). The next step is to forecast the disease using a bio-inspired hybrid mutation-based swarm intelligence (HMSI) with an attention-based gated recurrent unit network (AttGRU) model after recursive feature elimination (RFE) has determined which features are most important. In our implementation, we compare four machine learning algorithms: SAE + ANN (sparse autoencoder + artificial neural network), LR (logistic regression), KNN (K-nearest neighbour), and naïve Bayes. The experiment results indicate that a 95.42% accuracy rate for the hybrid model's suggested heart disease prediction is attained, which effectively outperforms and overcomes the prescribed research gap in mentioned related work.
multidisciplinary sciences
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address several key issues in heart disease diagnosis, particularly in improving the accuracy of heart disease prediction using machine learning techniques in a big data environment. Specifically, the paper addresses the following aspects: 1. **Early Identification of Heart Disease**: - Heart disease is one of the leading causes of death worldwide, and early identification can save many lives. However, traditional diagnostic methods (such as those based on physical lab reports, expert symptom analysis reports, and medical history records) have several shortcomings, including inaccurate diagnostic results, high costs, and computational complexity. 2. **Heart Disease Prediction in a Big Data Environment**: - In medical data processing, heart disease prediction faces challenges from the large amount of available data and various risk factors (such as cholesterol levels, high blood pressure, and abnormal heart rates). Therefore, there is a need to optimize treatment plans and appropriate decision support systems to identify heart risks early. 3. **Improving the Accuracy of Prediction Models**: - By utilizing a hybrid deep learning model, combining an improved K-means clustering (IKC) algorithm to remove outliers, and using Synthetic Minority Over-sampling Technique (SMOTE) to balance class distribution. Additionally, important features are selected through Recursive Feature Elimination (RFE), and disease prediction is performed using an Attention-based Gated Recurrent Unit network (AttGRU), significantly improving the model's prediction accuracy. 4. **Comparison with Existing Models**: - The hybrid model proposed in this paper achieves an accuracy of 95.42% in heart disease prediction, significantly outperforming several existing machine learning algorithms (such as Sparse Autoencoder + Artificial Neural Network, Logistic Regression, K-Nearest Neighbors, and Naive Bayes), effectively addressing issues present in related research. In summary, this paper proposes a heart disease prediction framework that combines big data analysis and hybrid deep learning methods, aiming to improve the accuracy and efficiency of heart disease diagnosis and overcome the limitations of traditional diagnostic methods.