Multisource representation learning for pediatric knowledge extraction from electronic health records

Mengyan Li,Xiaoou Li,Kevin Pan,Alon Geva,Doris Yang,Sara Morini Sweet,Clara-Lea Bonzel,Vidul Ayakulangara Panickan,Xin Xiong,Kenneth Mandl,Tianxi Cai
DOI: https://doi.org/10.1038/s41746-024-01320-4
IF: 15.2
2024-11-17
npj Digital Medicine
Abstract:Electronic Health Record (EHR) systems are particularly valuable in pediatrics due to high barriers in clinical studies, but pediatric EHR data often suffer from low content density. Existing EHR code embeddings tailored for the general patient population fail to address the unique needs of pediatric patients. To bridge this gap, we introduce a transfer learning approach, MU ltisource G raph S ynthesis (MUGS), aimed at accurate knowledge extraction and relation detection in pediatric contexts. MUGS integrates graphical data from both pediatric and general EHR systems, along with hierarchical medical ontologies, to create embeddings that adaptively capture both the homogeneity and heterogeneity between hospital systems. These embeddings enable refined EHR feature engineering and nuanced patient profiling, proving particularly effective in identifying pediatric patients similar to specific profiles, with a focus on pulmonary hypertension (PH). MUGS embeddings, resistant to negative transfer, outperform other benchmark methods in multiple applications, advancing evidence-based pediatric research.
health care sciences & services,medical informatics
What problem does this paper attempt to address?