A machine learning model to predict the risk of depression in US adults with obstructive sleep apnea hypopnea syndrome: a cross-sectional study

Enguang Li,Fangzhu Ai,Chunguang Liang
DOI: https://doi.org/10.3389/fpubh.2023.1348803
IF: 5.2
2024-01-08
Frontiers in Public Health
Abstract:Objective Depression is very common and harmful in patients with obstructive sleep apnea hypopnea syndrome (OSAHS). It is necessary to screen OSAHS patients for depression early. However, there are no validated tools to assess the likelihood of depression in patients with OSAHS. This study used data from the National Health and Nutrition Examination Survey (NHANES) database and machine learning (ML) methods to construct a risk prediction model for depression, aiming to predict the probability of depression in the OSAHS population. Relevant features were analyzed and a nomogram was drawn to visually predict and easily estimate the risk of depression according to the best performing model. Study design This is a cross-sectional study. Methods Data from three cycles (2005–2006, 2007–2008, and 2015–2016) were selected from the NHANES database, and 16 influencing factors were screened and included. Three prediction models were established by the logistic regression algorithm, least absolute shrinkage and selection operator (LASSO) algorithm, and random forest algorithm, respectively. The receiver operating characteristic (ROC) area under the curve (AUC), specificity, sensitivity, and decision curve analysis (DCA) were used to assess evaluate and compare the different ML models. Results The logistic regression model had lower sensitivity than the lasso model, while the specificity and AUC area were higher than the random forest and lasso models. Moreover, when the threshold probability range was 0.19–0.25 and 0.45–0.82, the net benefit of the logistic regression model was the largest. The logistic regression model clarified the factors contributing to depression, including gender, general health condition, body mass index (BMI), smoking, OSAHS severity, age, education level, ratio of family income to poverty (PIR), and asthma. Conclusion This study developed three machine learning (ML) models (logistic regression model, lasso model, and random forest model) using the NHANES database to predict depression and identify influencing factors among OSAHS patients. Among them, the logistic regression model was superior to the lasso and random forest models in overall prediction performance. By drawing the nomogram and applying it to the sleep testing center or sleep clinic, sleep technicians and medical staff can quickly and easily identify whether OSAHS patients have depression to carry out the necessary referral and psychological treatment.
public, environmental & occupational health
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the issue of early screening for depression in patients with Obstructive Sleep Apnea-Hypopnea Syndrome (OSAHS). Specifically, the research objectives are: 1. **Constructing a Predictive Model**: Using machine learning methods and data from the National Health and Nutrition Examination Survey (NHANES) database, construct a model that can predict the risk of depression in OSAHS patients. 2. **Identifying Influencing Factors**: Analyze the factors related to the risk of depression in OSAHS patients and establish a visual nomogram based on these factors to facilitate quick assessment of patients' depression risk by healthcare professionals. ### Background - **Prevalence and Harm of Depression**: Depression is a common mental health disorder that severely affects patients' psychological and social functioning, reducing their quality of life. It also imposes significant economic and emotional stress on patients' families. - **Relationship Between OSAHS and Depression**: Multiple studies have shown that OSAHS may increase the risk of depression, and the severity of OSAHS is dose-dependently related to the risk of depression. - **Limitations of Existing Screening Tools**: Currently, there is no depression self-assessment scale specifically for OSAHS patients. Although existing self-assessment scales have certain reliability and validity, they cannot accurately predict the risk of depression in OSAHS patients. ### Research Methods - **Data Source**: Data from the 2005-2006, 2007-2008, and 2015-2016 cycles of the NHANES database were selected. - **Predictive Models**: Predictive models were constructed using Logistic Regression, Least Absolute Shrinkage and Selection Operator (LASSO), and Random Forest algorithms. - **Evaluation Metrics**: The performance of different models was evaluated and compared using the Area Under the Receiver Operating Characteristic Curve (AUC), specificity, sensitivity, and Decision Curve Analysis (DCA). ### Main Results - **Model Performance**: The Logistic Regression model outperformed the LASSO and Random Forest models in overall predictive performance, especially in terms of specificity and AUC. - **Influencing Factors**: Gender, general health status, Body Mass Index (BMI), smoking, severity of OSAHS, age, education level, family income-to-poverty ratio (PIR), and asthma are the main factors influencing the risk of depression in OSAHS patients. - **Nomogram**: A nomogram was created to facilitate quick assessment of depression risk in OSAHS patients by healthcare professionals. ### Conclusion - **Research Significance**: By constructing machine learning models, the risk of depression in OSAHS patients can be more accurately predicted, aiding in early screening and intervention, thereby reducing the impact of depression on patients' quality of life and socio-economic status. - **Practical Application**: The nomogram can be used in sleep detection centers or sleep clinics to help healthcare professionals quickly identify the risk of depression in OSAHS patients and provide necessary referrals and psychological treatment.