Hypertension Prediction Using Machine Learning Technique

Youngkeun Choi,Jae Choi
DOI: https://doi.org/10.4018/ijsds.2020070103
2020-07-01
International Journal of Strategic Decision Sciences
Abstract:Machine learning technology is used in advanced data analysis and optimization approaches for different kinds of medical problems. Hypertension is complicated, and every year it causes a lot of many severe illnesses such as stroke and heart disease. This study essentially had two primary goals. Firstly, this paper intends to understand the role of variables in hypertension modeling better. Secondly, the study seeks to evaluate the predictive performance of the decision trees. Based on these results, first, age, BMI, and average glucose level influence hypertension significantly, while other variables have an influence. Second, for the full model, the accuracy rate is 0.905, which implies that the error rate is 0.095. Among the patients who were predicted not to have hypertension, the accuracy that would not have hypertension was 90.51%, and the accuracy that had strike was 30.77% among the patients who were predicted to have hypertension.
What problem does this paper attempt to address?
Based on the provided paper abstract and partial content, the problem that this research attempts to solve is to predict hypertension through machine - learning techniques. Specifically, the authors used machine - learning methods such as decision trees to develop a prediction model to help identify patients who may have hypertension. ### Problems the paper attempts to solve 1. **Hypertension prediction**: - Hypertension is a common chronic disease, and early detection and intervention are crucial for preventing complications. However, traditional diagnostic methods often rely on regular physical examinations and clinical symptoms, making it difficult to achieve early screening in large - scale populations. - This study aims to use machine - learning techniques, especially the decision - tree algorithm, to build an efficient hypertension prediction model in order to identify potential hypertensive patients at an early stage. 2. **Improving prediction accuracy**: - The research trains and optimizes the machine - learning model by analyzing a large amount of patient data, including demographic information, lifestyle, and health status, etc., to improve the accuracy and reliability of prediction. 3. **Assisting medical decision - making**: - By providing a reliable hypertension prediction tool, doctors can intervene and treat earlier, thereby improving the prognosis of patients and reducing the burden on the medical system. ### Method overview - **Dataset**: The research used a dataset containing multiple variables, which cover information on patients' personal backgrounds, living habits, health status, etc. - **Model selection**: The decision tree (Decision Tree), especially the ID3 algorithm, was adopted as the main machine - learning model to construct a classification tree. - **Performance evaluation**: The performance of the model was evaluated through a series of indicators (such as precision, recall, F - measure, etc.) to ensure its effectiveness in practical applications. ### Key formulas When evaluating the performance of the model, several key performance - indicator formulas were mentioned in the paper: - **Precision**: \[ \text{Precision}=\frac{\text{TP}}{\text{TP}+\text{FP}} \] - **Recall**: \[ \text{Recall}=\frac{\text{TP}}{\text{TP}+\text{FN}} \] - **F - measure**: \[ \text{F - measure}=2\cdot\frac{\text{Precision}\cdot\text{Recall}}{\text{Precision}+\text{Recall}} \] Among them, TP (True Positive) represents the number of cases correctly predicted as hypertension, FP (False Positive) represents the number of cases wrongly predicted as hypertension, and FN (False Negative) represents the number of cases wrongly predicted as non - hypertension. In summary, the main objective of this paper is to develop an efficient hypertension prediction model through machine - learning techniques, especially the decision - tree algorithm, to improve the efficiency and accuracy of early screening and intervention.