Disease Correlation Network: a Computational Package for Identifying Temporal Correlations Between Disease States from Large-Scale Longitudinal Medical Records.

Huaiying Lin,Ruichen Rong,Xiang Gao,Kashi Revanna,Michael Zhao,Petar Bajic,David Jin,Chengjun Hu,Qunfeng Dong
DOI: https://doi.org/10.1093/jamiaopen/ooz031
2019-01-01
JAMIA Open
Abstract:OBJECTIVE:To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR).MATERIALS AND METHODS:We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched cohorts from longitudinal medical records to assess for significant temporal correlations between diseases using two independent methodologies: Cox proportional hazards regression and random forest survival analysis. This optimizable package has the potential to control for relevant confounding factors such as age, gender, and other demographic and medical characteristics. Output is presented as a DCN which may be analyzed using a JavaScript-based interactive visualization tool for users to explore statistically significant correlations between disease states of interest using graph-theory-based network topology.RESULTS:We have applied this package to a longitudinal dataset at Loyola University Chicago Medical Center with 654 084 distinct initial diagnoses of 51 conditions in 175 539 patients. Over 90% of disease correlations identified are supported by literature review. DCN is available for download at https://github.com/qunfengdong/DCN.CONCLUSIONS:DCN allows screening of EMR data to identify potential relationships between chronic disease states. This data may then be used to formulate novel research hypotheses for further characterization of these relationships.
What problem does this paper attempt to address?