Predicting time to graduation at a large enrollment American university

John M. Aiken,Riccardo De Bin,Morten Hjorth-Jensen,Marcos D. Caballero
DOI: https://doi.org/10.1371/journal.pone.0242334
IF: 3.7
2020-11-13
PLoS ONE
Abstract:The time it takes a student to graduate with a university degree is mitigated by a variety of factors such as their background, the academic performance at university, and their integration into the social communities of the university they attend. Different universities have different populations, student services, instruction styles, and degree programs, however, they all collect institutional data. This study presents data for 160,933 students attending a large American research university. The data includes performance, enrollment, demographics, and preparation features. Discrete time hazard models for the time-to-graduation are presented in the context of Tinto’s Theory of Drop Out. Additionally, a novel machine learning method: gradient boosted trees, is applied and compared to the typical maximum likelihood method. We demonstrate that enrollment factors (such as changing a major) lead to greater increases in model predictive performance of when a student graduates than performance factors (such as grades) or preparation (such as high school GPA).
multidisciplinary sciences
What problem does this paper attempt to address?