Prediction of COPD Using Machine Learning, Clinical Summary Notes, and Vital Signs

Negar Orangi-Fard
2024-09-05
Abstract:Chronic obstructive pulmonary disease (COPD) is a chronic inflammatory lung disease that causes obstructed airflow from the lungs. In the United States, more than 15.7 million Americans have been diagnosed with COPD, with 96% of individuals living with at least one other chronic health condition. It is the 4th leading cause of death in the country. Over 2.2 million patients are admitted to hospitals annually due to COPD exacerbations. Monitoring and predicting patient exacerbations on-time could save their life. This paper presents two different predictive models to predict COPD exacerbation using AI and natural language processing (NLP) approaches. These models use respiration summary notes, symptoms, and vital signs. To train and test these models, data records containing physiologic signals and vital signs time series were used. These records were captured from patient monitors and comprehensive clinical data obtained from hospital medical information systems for tens of thousands of Intensive Care Unit (ICU) patients. We achieved an area under the Receiver operating characteristic (ROC) curve of 0.82 in detection and prediction of COPD exacerbation.
Machine Learning,Computation and Language
What problem does this paper attempt to address?
This paper aims to solve the problem of predicting exacerbations in patients with chronic obstructive pulmonary disease (COPD). Specifically, the paper proposes two different prediction models, using artificial intelligence (AI) and natural language processing (NLP) technologies, combined with respiratory summary notes, symptoms and vital signs data to predict exacerbations in COPD patients. By using a large number of physiological signal and vital signs time - series records of intensive care unit (ICU) patients obtained from the hospital medical information system to train and test these models, it is expected to improve the detection and prediction accuracy of COPD exacerbations, thereby improving the treatment effect of patients and reducing the burden on the medical system. The paper mentions that COPD is the fourth leading cause of death in the United States, and more than 2.2 million patients are hospitalized each year due to COPD exacerbations. Timely monitoring and prediction of patients' exacerbations can save lives. However, in clinical practice, COPD is often under - diagnosed or misdiagnosed, especially in the early stages. Therefore, developing models that can accurately predict COPD exacerbations has important clinical significance. By analyzing and comparing the performance of different machine - learning methods, the paper found that the support vector machine (SVM) performs best in predicting COPD exacerbations, with the area under the receiver operating characteristic (ROC) curve reaching 0.82, showing good predictive performance.