COLLAGENE enables privacy-aware federated and collaborative genomic data analysis

Wentao Li,Miran Kim,Kai Zhang,Han Chen,Xiaoqian Jiang,Arif Harmanci
DOI: https://doi.org/10.1186/s13059-023-03039-z
IF: 17.906
2023-09-13
Genome Biology
Abstract:Growing regulatory requirements set barriers around genetic data sharing and collaborations. Moreover, existing privacy-aware paradigms are challenging to deploy in collaborative settings. We present COLLAGENE, a tool base for building secure collaborative genomic data analysis methods. COLLAGENE protects data using shared-key homomorphic encryption and combines encryption with multiparty strategies for efficient privacy-aware collaborative method development. COLLAGENE provides ready-to-run tools for encryption/decryption, matrix processing, and network transfers, which can be immediately integrated into existing pipelines. We demonstrate the usage of COLLAGENE by building a practical federated GWAS protocol for binary phenotypes and a secure meta-analysis protocol. COLLAGENE is available at https://zenodo.org/record/8125935.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?