An innovative supervised longitudinal learning procedure of recurrent neural networks with temporal data augmentation: Insights from predicting fetal macrosomia and large-for-gestational age

Rongjie Liu,Yuanxin Yao,Cancan Zhang,Bo Zhang
DOI: https://doi.org/10.1016/j.compbiomed.2024.108665
IF: 7.7
2024-05-30
Computers in Biology and Medicine
Abstract:Background Longitudinal data in health informatics studies often presented challenges due to sparse observations from each subject, limiting the application of contemporary deep learning for prediction. This issue was particularly relevant in predicting birthweight, a crucial factor in identifying conditions such as macrosomia and large-for-gestational age (LGA). Previous approaches had relied on empirical formulas for estimated fetal weights (EFWs) from ultrasound measurements and mixed-effects models for interim predictions. Method The proposed novel supervised longitudinal learning procedure features a three-step approach. Firstly, EFWs were generated using empirical formulas from ultrasound measurements. Subsequently, nonlinear mixed-effects models were employed to create augmented sequences of EFWs, spanning daily gestational timepoints. This augmentation transformed sparse longitudinal data into a dense parallel sequence suitable for training recurrent neural networks (RNNs). A tailored RNN architecture was then devised to incorporate the augmented sequential EFWs along with non-sequential maternal characteristics. Results The RNN algorithms were trained on augmented data to predict birthweights, which were further classified for macrosomia and LGA. Application of this supervised longitudinal learning procedure to the Successive Small-for-Gestational-Age Births study yielded improved performance in classification metrics. Specifically, sensitivity, area under the receiver operation characteristic curve, and Youden's Index demonstrated enhanced results, indicating the effectiveness of the proposed approach in overcoming sparsity challenges in longitudinal health informatics data. Conclusions The integration of mixed-effects models for temporal data augmentation and RNNs on augmented sequences showed effective in accurately predicting birthweights, particularly in the context of identifying excessive fetal growth conditions.
computer science, interdisciplinary applications,engineering, biomedical,biology,mathematical & computational biology
What problem does this paper attempt to address?