Clinical signatures of genetic epilepsies precede diagnosis in electronic medical records of 32,000 individuals
Peter D Galer,Shridhar Parthasarathy,Julie Xian,Jillian L McKee,Sarah M Ruggiero,Shiva Ganesan,Michael C Kaufman,Stacey R Cohen,Scott Haag,Chen Chen,William K S Ojemann,Dan Kim,Olivia Wilmarth,Priya Vaidiswaran,Casey Sederman,Colin A Ellis,Alexander K Gonzalez,Christian M Boßelmann,Dennis Lal,Rob Sederman,David Lewis-Smith,Brian Litt,Ingo Helbig
DOI: https://doi.org/10.1016/j.gim.2024.101211
2024-07-13
Abstract:Purpose: An early genetic diagnosis can guide the time-sensitive treatment of individuals with genetic epilepsies. However, most genetic diagnoses occur long after disease onset. We aimed to identify early clinical features suggestive of genetic diagnoses in individuals with epilepsy through large-scale analysis of full-text electronic medical records (EMR). Methods: We extracted 89 million time-stamped standardized clinical annotations using Natural Language Processing from 4,572,783 clinical notes from 32,112 individuals with childhood epilepsy, including 1,925 individuals with known or presumed genetic epilepsies. We applied these features to train random forest models to predict SCN1A-related disorders and any genetic diagnosis. Results: We identified 47,774 age-dependent associations of clinical features with genetic etiologies a median of 3.6 years prior to molecular diagnosis. Across all 710 genetic etiologies identified in our cohort, neurodevelopmental differences between 6-9 months increased the likelihood of a later molecular diagnosis fivefold (P<0.0001, 95% CI=3.55-7.42). A later diagnosis of SCN1A-related disorders (AUC=0.91) or an overall positive genetic diagnosis (AUC=0.82) could be reliably predicted using random forest models. Conclusion: Clinical features predictive of genetic epilepsies precede molecular diagnoses by up to several years in conditions with known precision treatments. An earlier diagnosis facilitated by automated EMR analysis has the potential for earlier targeted therapeutic strategies in the genetic epilepsies.