Development and Evaluation of Deep Learning Models for Cardiotocography Interpretation

Nicole Chiou,Nichole Young-Lin,Christopher Kelly,Julie Cattiau,Tiya Tiyasirichokchai,Abdoulaye Diack,Sanmi Koyejo,Katherine Heller,Mercy Asiedu
DOI: https://doi.org/10.1101/2024.03.05.24303805
2024-03-06
Abstract:The inherent variability in the visual interpretation of cardiotocograms (CTGs) by obstetric clinical experts, both intra- and inter-observer, presents a substantial challenge in obstetric care. In response, we investigate automated CTG interpretation as a potential solution to enhance the early detection of fetal hypoxia during labor, which has the potential to reduce unnecessary operative interventions and improve overall maternal and neonatal care. This study employs deep learning techniques to reduce the subjectivity associated with visual CTG interpretation. Our results demonstrate that using objective umbilical cord blood pH outcome measurements, rather than clinician-defined Apgar scores, yields more consistent and robust model performance. Additionally, through a series of ablation studies, we explore the impact of temporal distribution shifts on the performance of these deep learning models. We examine tradeoffs between performance and fairness, specifically evaluating performance across demographic and clinical subgroups. Finally, we discuss the practical implications of our findings for the real-world deployment of such systems, emphasizing their potential utility in medical settings with limited resources.
Obstetrics and Gynecology
What problem does this paper attempt to address?