An Efficient Digital Twin Assisted Clustered Federated Learning Algorithm for Disease Prediction.

Xiaoming Yuan,Jialin Zhang,Jingqi Luo,Jiahui Chen,Zhiguo Shi,Mingwei Qin
DOI: https://doi.org/10.1109/vtc2022-spring54318.2022.9860704
2022-01-01
Abstract:In-depth analysis of medical data through machine learning to achieve disease prediction is beneficial to the early detection and treatment of diseases. However, medical data involves mass patient privacy, and datasets of different medical institutions cannot be directly shared due to privacy protection. So medical data often exists in the form of data islands, which makes it difficult for most existing prediction models to complete disease prediction. In this paper, a digital twin assisted efficient clustering Federated Learning (FL) algorithm for disease prediction is proposed. It can break data islands to predict diseases on the premise of privacy security. Firstly, we design an efficient clustering Federated Learning with Client Selection (FLCS) protocol based on heterogeneity and contribution to improve the training efficiency and prediction accuracy. Secondly, we use digital twin to assist the FLCS protocol to carry out large-scale prediction. In addition, the shapley value introduced in the calculation of client contribution makes the model interpretable and enhances the reliability of prediction results. Finally, the evaluation results show that compared with the common prediction models and FedAvg algorithm, the FLCS protocol assisted by digital twin has better efficiency and accuracy in binary classification prediction.
What problem does this paper attempt to address?