Split learning for health: Distributed deep learning without sharing raw patient data

Praneeth Vepakomma,Otkrist Gupta,Tristan Swedish,Ramesh Raskar
DOI: https://doi.org/10.48550/arXiv.1812.00564
IF: 5.414
2018-12-03
Machine Learning
Abstract:Can health entities collaboratively train deep learning models without sharing sensitive raw data? This paper proposes several configurations of a distributed deep learning method called SplitNN to facilitate such collaborations. SplitNN does not share raw data or model details with collaborating institutions. The proposed configurations of splitNN cater to practical settings of i) entities holding different modalities of patient data, ii) centralized and local health entities collaborating on multiple tasks and iii) learning without sharing labels. We compare performance and resource efficiency trade-offs of splitNN and other distributed deep learning methods like federated learning, large batch synchronous stochastic gradient descent and show highly encouraging results for splitNN.
What problem does this paper attempt to address?