Exploring the tradeoff between data privacy and utility with a clinical data analysis use case

Eunyoung Im,Hyeoneui Kim,Hyungbok Lee,Xiaoqian Jiang,Ju Han Kim
DOI: https://doi.org/10.1186/s12911-024-02545-9
IF: 3.298
2024-06-01
BMC Medical Informatics and Decision Making
Abstract:Securing adequate data privacy is critical for the productive utilization of data. De-identification, involving masking or replacing specific values in a dataset, could damage the dataset's utility. However, finding a reasonable balance between data privacy and utility is not straightforward. Nonetheless, few studies investigated how data de-identification efforts affect data analysis results. This study aimed to demonstrate the effect of different de-identification methods on a dataset's utility with a clinical analytic use case and assess the feasibility of finding a workable tradeoff between data privacy and utility.
medical informatics
What problem does this paper attempt to address?