Using machine learning to predict obesity in high school students

Zeyu Zheng,Karen Ruggiero
DOI: https://doi.org/10.1109/bibm.2017.8217988
2017-11-01
Abstract:Four enhanced machine learning models were used to predict obesity in high school students by focusing on both risk and protective factors: binary logistic regression; improved decision tree (IDT); weighted k-nearest neighbor (KNN); and artificial neural network (ANN). Nine health-related behaviors from the 2015 Youth Risk Behavior Surveillance System (YRBSS) for the state of Tennessee were used as model inputs. Results show that, compared to the logistic regression model that achieved 56.02% accuracy and 54.77% specificity, IDT, weighted KNN, and ANN each performed significantly better. The IDT model achieved 80.23% accuracy and 90.74% specificity, while the weighted KNN model achieved 88.82% accuracy and 93.440% specificity. The ANN model achieved 84.220% accuracy and 99.46% specificity. Implications and suggestions for slowing the increase in adolescent obesity are discussed.
What problem does this paper attempt to address?