Predictive models of miscarriage based on data from a preconception cohort study
Jennifer J. Yland,Zahra Zad,Tanran R. Wang,Amelia K. Wesselink,Tammy Jiang,Elizabeth E. Hatch,Ioannis Ch. Paschalidis,Lauren A. Wise
DOI: https://doi.org/10.1016/j.fertnstert.2024.04.007
IF: 6.7
2024-04-11
Fertility and Sterility
Abstract:Structured Objective To use self-reported preconception data to derive models that predict risk of miscarriage. Design Prospective preconception cohort study. Subjects Study participants were female, aged 21-45 years, residents of the United States or Canada, and attempting spontaneous pregnancy at enrollment during 2013-2022. Participants were followed for up to 12 months of pregnancy attempts; those who conceived were followed through pregnancy and postpartum. We restricted analyses to participants who conceived during the study period. Exposure On baseline and follow-up questionnaires completed every 8 weeks until pregnancy, we collected self-reported data on sociodemographic factors, reproductive history, lifestyle, anthropometrics, diet, medical history, and male partner characteristics. We included 160 potential predictor variables in our models. Main Outcome Measures The primary outcome was miscarriage, defined as pregnancy loss before 20 weeks' gestation. We followed participants from their first positive pregnancy test until miscarriage or a censoring event (induced abortion, ectopic pregnancy, loss to follow-up, or 20 weeks' gestation), whichever occurred first. We fit both survival and static models, using Cox proportional hazards models, logistic regression, support vector machines, Gradient Boosted Trees, and Random Forest algorithms. We evaluated model performance using the concordance index (survival models) and the weighted-F1 score (static models). Results Among 8,720 participants who conceived, 20.4% reported miscarriage. In multivariable models, the strongest predictors of miscarriage were female age, history of miscarriage, and male partner age. The weighted-F1 score ranged from 73-89% for static models and the concordance index ranged from 53-56% for survival models, indicating better discrimination for the static models compared with the survival models ( i.e. , ability of the model to discriminate between individuals with and without miscarriage). No appreciable differences were observed across strata of miscarriage history or among models restricted to ≥8 weeks' gestation. Conclusion Our findings suggest that miscarriage is not easily predicted based on preconception lifestyle characteristics, and that advancing age and history of miscarriage are the most important predictors of incident miscarriage.
obstetrics & gynecology,reproductive biology