FedRBE -- a decentralized privacy-preserving federated batch effect correction tool for omics data based on limma

Yuliya Burankova,Julian Klemm,Jens J. G. Lohmann,Ahmad Taheri,Niklas Probul,Jan Baumbach,Olga Zolotareva
2024-12-08
Abstract:Batch effects in omics data obscure true biological signals and constitute a major challenge for privacy-preserving analyses of distributed patient data. Existing batch effect correction methods either require data centralization, which may easily conflict with privacy requirements, or lack support for missing values and automated workflows. To bridge this gap, we developed fedRBE, a federated implementation of limma's removeBatchEffect method. We implemented it as an app for the FeatureCloud platform. Unlike its existing analogs, fedRBE effectively handles data with missing values and offers an automated, user-friendly online user interface (<a class="link-external link-https" href="https://featurecloud.ai/app/fedrbe" rel="external noopener nofollow">this https URL</a>). Leveraging secure multi-party computation provides enhanced security guarantees over classical federated learning approaches. We evaluated our fedRBE algorithm on simulated and real omics data, achieving performance comparable to the centralized method with negligible differences (no greater than 3.6E-13). By enabling collaborative correction without data sharing, fedRBE facilitates large-scale omics studies where batch effect correction is crucial.
Quantitative Methods,Cryptography and Security,Distributed, Parallel, and Cluster Computing,Machine Learning
What problem does this paper attempt to address?