Secure Feature Selection for Vertical Federated Learning in Ehealth Systems

Rui Zhang,Hongwei Li,Meng Hao,Hanxiao Chen,Yuan Zhang
DOI: https://doi.org/10.1109/icc45855.2022.9838917
2022-01-01
Abstract:Privacy-preserving vertical federated learning (VFL) has been widely applied in electronic health (eHealth) systems. However, existing VFL schemes rarely consider the data pre-processing step including feature selection, which will lead to poor convergence rate and even damaging the model utility. In this paper, we propose an efficient and privacy-preserving feature selection scheme for VFL. Specifically, we first propose a general Gini-impurity based feature selection framework, which is compatible with most existing machine learning models in VFL. With the framework, we present two concrete protocols (dubbed π SS−FS and π H−FS , respectively) customized for different eHealth scenarios. π SS−FS exploits a lightweight additive secret sharing technique, such that it can be executed in comparable time as the evaluation of the plaintext scheme. π H−FS is a hybrid feature selection protocol that additionally utilizes a linear homomorphic encryption technique, to reduce the communication overhead at the cost of a moderate runtime. Moreover, extensive evaluations conducted on real-world medical datasets demonstrate that our scheme realizes up to 27% accuracy gains.
What problem does this paper attempt to address?