A Privacy Knowledge Transfer Method for Clinical Concept Extraction.

Xuan Luo,Yiping Yin,Yice Zhang,Ruifeng Xu
DOI: https://doi.org/10.1007/978-3-030-96033-9_2
2021-01-01
Abstract:Recent works have revealed that there is a training data leakage hazard in deep learning models, which is catastrophic for tasks like clinical concept extraction with high privacy requirements. Therefore, to alleviate privacy leakage during the model release phase, this paper propose a knowledge distillation based privacy protection method. The proposed method follows the teacher-student framework and utilizes a novel distillation method for sequence labeling so that the final model is trained without direct contact with sensitive source data. This paper mainly focuses on the scenario where the training data is multi-source and heterogeneous, and correspondingly proposes a re-normalization operation and weight adjustment strategy for knowledge aggregation. Experiment results on the public dataset show that the proposed privacy protection method could achieve comparable performance to the fully supervised method, demonstrating the effectiveness of the proposed method.
What problem does this paper attempt to address?