Encrypted federated learning for secure decentralized collaboration in cancer image analysis
Daniel Truhn,Soroosh Tayebi Arasteh,Oliver Lester Saldanha,Gustav Müller-Franzes,Firas Khader,Philip Quirke,Nicholas P West,Richard Gray,Gordon G A Hutchins,Jacqueline A James,Maurice B Loughrey,Manuel Salto-Tellez,Hermann Brenner,Alexander Brobeil,Tanwei Yuan,Jenny Chang-Claude,Michael Hoffmeister,Sebastian Foersch,Tianyu Han,Sebastian Keil,Maximilian Schulze-Hagen,Peter Isfort,Philipp Bruners,Georgios Kaissis,Christiane Kuhl,Sven Nebelung,Jakob Nikolas Kather,Nicholas P. West,Gordon G.A. Hutchins,Jacqueline A. James,Maurice B. Loughrey
DOI: https://doi.org/10.1016/j.media.2023.103059
IF: 10.9
2024-02-01
Medical Image Analysis
Abstract:Artificial intelligence (AI) has a multitude of applications in cancer research and oncology. However, the training of AI systems is impeded by the limited availability of large datasets due to data protection requirements and other regulatory obstacles. Federated and swarm learning represent possible solutions to this problem by collaboratively training AI models while avoiding data transfer. However, in these decentralized methods, weight updates are still transferred to the aggregation server for merging the models. This leaves the possibility for a breach of data privacy, for example by model inversion or membership inference attacks by untrusted servers. Somewhat-homomorphically-encrypted federated learning (SHEFL) is a solution to this problem because only encrypted weights are transferred, and model updates are performed in the encrypted space. Here, we demonstrate the first successful implementation of SHEFL in a range of clinically relevant tasks in cancer image analysis on multicentric datasets in radiology and histopathology. We show that SHEFL enables the training of AI models which outperform locally trained models and perform on par with models which are centrally trained. In the future, SHEFL can enable multiple institutions to co-train AI models without forsaking data governance and without ever transmitting any decryptable data to untrusted servers.
engineering, biomedical,computer science, interdisciplinary applications, artificial intelligence,radiology, nuclear medicine & medical imaging