Mutual Information Optimally Local Private Discrete Distribution Estimation

Shaowei Wang,Liusheng Huang,Pengzhan Wang,Yiwen Nie,Hongli Xu,Wei Yang,Xiang-Yang Li,Chunming Qiao
DOI: https://doi.org/10.48550/arxiv.1607.08025
2016-01-01
Abstract:Consider statistical learning (e.g. discrete distribution estimation) with local ϵ-differential privacy, which preserves each data provider's privacy locally, we aim to optimize statistical data utility under the privacy constraints. Specifically, we study maximizing mutual information between a provider's data and its private view, and give the exact mutual information bound along with an attainable mechanism: k-subset mechanism as results. The mutual information optimal mechanism randomly outputs a size k subset of the original data domain with delicate probability assignment, where k varies with the privacy level ϵ and the data domain size d. After analysing the limitations of existing local private mechanisms from mutual information perspective, we propose an efficient implementation of the k-subset mechanism for discrete distribution estimation, and show its optimality guarantees over existing approaches.
What problem does this paper attempt to address?