Collaborative learning from distributed data with differentially private synthetic data

Lukas Prediger,Joonas Jälkö,Antti Honkela,Samuel Kaski
DOI: https://doi.org/10.1186/s12911-024-02563-7
IF: 3.298
2024-06-17
BMC Medical Informatics and Decision Making
Abstract:Consider a setting where multiple parties holding sensitive data aim to collaboratively learn population level statistics, but pooling the sensitive data sets is not possible due to privacy concerns and parties are unable to engage in centrally coordinated joint computation. We study the feasibility of combining privacy preserving synthetic data sets in place of the original data for collaborative learning on real-world health data from the UK Biobank.
medical informatics
What problem does this paper attempt to address?