Machine learning model for depression based on heavy metals among aging people: A study with National Health and Nutrition Examination Survey 2017–2018

Fang Xia,Qingwen Li,Xin Luo,Jinyi Wu
DOI: https://doi.org/10.3389/fpubh.2022.939758
IF: 5.2
2022-08-04
Frontiers in Public Health
Abstract:Objective To explore the association between depression and blood metal elements, we conducted this machine learning model fitting research. Methods Datasets from the National Health and Nutrition Examination Survey (NHANES) in 2017–2018 were downloaded ( https://www.cdc.gov/nchs/nhanes ). After screening, 3,247 aging samples with 10 different metals [lead (Pb), mercury (Hg), cadmium (Cd), manganese (Mn), selenium (Se), chromium (Cr), cobalt (Co), inorganic mercury (InHg), methylmercury (MeHg) and ethyl mercury (EtHg)] were included. Eight machine learning algorithms were compared for analyzing metal and depression. After comparison, XGBoost showed optimal effects. Poisson regression and XGBoost model (a kind of decision tree algorithm) were conducted to find the risk factors and prediction for depression. Results A total of 344 individuals out of 3247 participants were diagnosed with depression. In the Poisson model, we found Cd (β = 0.22, P = 0.00000941), EtHg (β = 3.43, P = 0.003216), and Hg (β=-0.15, P = 0.001524) were related with depression. XGBoost model was the suitable algorithm for the evaluation of depression, the accuracy was 0.89 with 95%CI (0.87, 0.92) and Kappa value was 0.006. Area under the curve (AUC) was 0.88. After that, an online XGBoost application for depression prediction was developed. Conclusion Blood heavy metals, especially Cd, EtHg, and Hg were significantly associated with depression and the prediction of depression was imperative.
public, environmental & occupational health
What problem does this paper attempt to address?