A Survey on Differential Privacy for Unstructured Data Content
Ying Zhao,Jinjun Chen
DOI: https://doi.org/10.1145/3490237
IF: 16.6
2022-01-06
ACM Computing Surveys
Abstract:Huge amount of unstructured data including image, video, audio, and text are ubiquitously generated and shared, it is a challenge to protect sensitive personal information in them, such as human faces, voiceprints, and authorships. Differential privacy is the standard privacy protection technology that provides rigorous privacy guarantees for various data. This survey summarizes and analyzes differential privacy solutions to protect unstructured data content before they are shared with untrusted parties. These differential privacy methods obfuscate unstructured data after they are represented with vectors, and then reconstruct them with obfuscated vectors. We summarize specific privacy models and mechanisms together with possible challenges in them. We also conclude their privacy guarantees against AI attacks and utility losses. Finally, we discuss several possible directions for future research.
computer science, theory & methods
What problem does this paper attempt to address?