A framework for sharing of clinical and genetic data for precision medicine applications
Ahmed Elhussein,Ulugbek Baymuradov,NYGC ALS Consortium,Noémie Elhadad,Karthik Natarajan,Gamze Gürsoy,Hemali Phatnani,Justin Kwan,Dhruv Sareen,James R Broach,Zachary Simmons,Ximena Arcila-Londono,Edward B Lee,Vivianna M Van Deerlin,Neil A Shneider,Ernest Fraenkel,Lyle W Ostrow,Frank Baas,Noah Zaitlen,James D Berry,Andrea Malaspina,Pietro Fratta,Gregory A Cox,Leslie M Thompson,Steve Finkbeiner,Efthimios Dardiotis,Timothy M Miller,Siddharthan Chandran,Suvankar Pal,Eran Hornstein,Daniel J MacGowan,Terry Heiman-Patterson,Molly G Hammell,Nikolaos A Patsopoulos,Joshua Dubnau,Avindra Nath,Robert Bowser,Matt Harms,Eleonora Aronica,Mary Poss,Jennifer Phillips-Cremins,John Crary,Nazem Atassi,Dale J Lange,Darius J Adams,Leonidas Stefanis,Marc Gotkine,Robert H Baloh,Suma Babu,Towfique Raj,Sabrina Paganoni,Ophir Shalem,Colin Smith,Bin Zhang,Brent Harris,Iris Broce,Vivian Drory,John Ravits,Corey McMillan,Vilas Menon,Lani Wu,Steven Altschuler,Yossef Lerner,Rita Sattler,Kendall Van Keuren-Jensen,Orit Rozenblatt-Rosen,Kerstin Lindblad-Toh,Katharine Nicholson,Peter Gregersen
DOI: https://doi.org/10.1038/s41591-024-03239-5
2024-09-03
Abstract:Precision medicine has the potential to provide more accurate diagnosis, appropriate treatment and timely prevention strategies by considering patients' biological makeup. However, this cannot be realized without integrating clinical and omics data in a data-sharing framework that achieves large sample sizes. Systems that integrate clinical and genetic data from multiple sources are scarce due to their distinct data types, interoperability, security and data ownership issues. Here we present a secure framework that allows immutable storage, querying and analysis of clinical and genetic data using blockchain technology. Our platform allows clinical and genetic data to be harmonized by combining them under a unified framework. It supports combined genotype-phenotype queries and analysis, gives institutions control of their data and provides immutable user access logs, improving transparency into how and when health information is used. We demonstrate the value of our framework for precision medicine by creating genotype-phenotype cohorts and examining relationships within them. We show that combining data across institutions using our secure platform increases statistical power for rare disease analysis. By offering an integrated, secure and decentralized framework, we aim to enhance reproducibility and encourage broader participation from communities and patients in data sharing.