Predicting Pediatric Genetic Epilepsy Through Electronic Medical Records: A Data-Driven Biomarker Discovery Approach

Yi Li
DOI: https://doi.org/10.1177/15357597241290322
2024-11-07
Epilepsy Currents
Abstract:Epilepsy Currents, Ahead of Print. Clinical Signatures of Genetic Epilepsies Precede Diagnosis in Electronic Medical Records of 32 000 IndividualsGaler PD, Parthasarathy S, Xian J, McKee JL, Ruggiero SM, Ganesan S, Kaufman MC, Cohen SR, Haag S, Chen C, Ojemann WKS, Kim D, Wilmarth O, Vaidiswaran P, Sederman C, Ellis CA, Gonzalez AK, Boßelmann CM, Lal D, Sederman R, Lewis-Smith D, Litt B, Helbig I. Genet Med. 2024101211. doi:10.1016/j.gim.2024.101211. PMID: 39011766Purpose: An early genetic diagnosis can guide the time-sensitive treatment of individuals with genetic epilepsies. However, most genetic diagnoses occur long after disease onset. We aimed to identify early clinical features suggestive of genetic diagnoses in individuals with epilepsy through large-scale analysis of full-text electronic medical records (EMRs). Methods: We extracted 89 million time-stamped standardized clinical annotations using Natural Language Processing from 4,572,783 clinical notes from 32 112 individuals with childhood epilepsy, including 1925 individuals with known or presumed genetic epilepsies. We applied these features to train random forest models to predict SCN1A-related disorders and any genetic diagnosis. Results: We identified 47 774 age-dependent associations of clinical features with genetic etiologies a median of 3.6 years prior to molecular diagnosis. Across all 710 genetic etiologies identified in our cohort, neurodevelopmental differences between 6 and 9 months increased the likelihood of a later molecular diagnosis fivefold (P < .0001, 95% CI = 3.55-7.42). A later diagnosis of SCN1A-related disorders (AUC = 0.91) or an overall positive genetic diagnosis (AUC = 0.82) could be reliably predicted using random forest models. Conclusion: Clinical features predictive of genetic epilepsies precede molecular diagnoses by up to several years in conditions with known precision treatments. An earlier diagnosis facilitated by automated EMR analysis has the potential for earlier targeted therapeutic strategies in the genetic epilepsies.
clinical neurology
What problem does this paper attempt to address?