Abstract:Generalizability, external validity, and reproducibility are high priorities for artificial intelligence applications in healthcare. Traditional approaches to addressing these elements involve sharing patient data between institutions or practice settings, which can compromise data privacy (individuals' right to prevent the sharing and disclosure of information about themselves) and data security (simultaneously preserving confidentiality, accuracy, fidelity, and availability of data). This article describes insights from real-world implementation of federated learning techniques that offer opportunities to maintain both data privacy and availability via collaborative machine learning that shares knowledge, not data. Local models are trained separately on local data. As they train, they send local model updates (e.g. coefficients or gradients) for consolidation into a global model. In some use cases, global models outperform local models on new, previously unseen local datasets, suggesting that collaborative learning from a greater number of examples, including a greater number of rare cases, may improve predictive performance. Even when sharing model updates rather than data, privacy leakage can occur when adversaries perform property or membership inference attacks which can be used to ascertain information about the training set. Emerging techniques mitigate risk from adversarial attacks, allowing investigators to maintain both data privacy and availability in collaborative healthcare research. When data heterogeneity between participating centers is high, personalized algorithms may offer greater generalizability by improving performance on data from centers with proportionately smaller training sample sizes. Properly applied, federated learning has the potential to optimize the reproducibility and performance of collaborative learning while preserving data security and privacy.

Knowledge abstraction and filtering based federated learning over heterogeneous data views in healthcare

A Federated Learning Framework Via Decentralized Data Valuation for Chronic Disease Healthcare

Privacy preservation for federated learning in health care

Federated learning for preserving data privacy in collaborative healthcare research

Federated Learning for Healthcare: Systematic Review and Architecture Proposal

On the Impact of Data Heterogeneity in Federated Learning Environments with Application to Healthcare Networks

Unified Fair Federated Learning for Digital Healthcare

Towards Fair and Privacy Preserving Federated Learning for the Healthcare Domain

Federated Learning for Data and Model Heterogeneity in Medical Imaging

Federated learning: Applications, challenges and future directions

Robust Federated Learning for Heterogeneous Model and Data

Privacy-Preserving Heterogeneous Federated Learning for Sensitive Healthcare Data

Federated Learning in Healthcare: Model Misconducts, Security, Challenges, Applications, and Future Research Directions -- A Systematic Review

The future of digital health with federated learning

A Review of Privacy Enhancement Methods for Federated Learning in Healthcare Systems

Security and Privacy Issues and Solutions in Federated Learning for Digital Healthcare

Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation

Improving Fairness in AI Models on Electronic Health Records: The Case for Federated Learning Methods

Enabling End-to-End Secure Federated Learning in Biomedical Research on Heterogeneous Computing Environments with APPFLx

Federated learning based futuristic biomedical big-data analysis and standardization