Abstract:Given a set of local sequential datasets held by multiple parties, we study the problem of publishing a synthetic dataset that preserves approximate sequentiality information of the integrated dataset while satisfying differential privacy for each local dataset. The existing solutions for publishing differentially private sequential data in the centralized setting mostly adopt tree-based approaches. Such approaches rely on different tree structures that encode sequential data's statistical information. The construction of a tree structure is normally done by recursively splitting nodes whose noisy scores (e.g., entropy or count) are larger than a given threshold. However, extending similar ideas to the multi-party setting is challenging. First, the comparison between noisy scores and a given threshold needs to be done in a distributed manner without letting the parties know the noisy scores, while satisfying differential privacy for each local dataset. Second, in the multi-party setting the large number of node splitting decisions incurs prohibitive computation costs. In addressing the above challenges, we present DPST, a distributed prediction suffix tree construction solution. In DPST, we first introduce a novel node splitting decision method that calculates the comparison result under encryption with substantially improved efficiency. Then we present a novel batch-based tree construction approach to reduce computation costs. In order to achieve high parallel performance without incurring any extra communication cost, we introduce the conjunction and slide methods to ensure that each batch contains a stable number of carefully arranged decision tasks. To further reduce communication and computation costs, we propose a prefix-based pre-pruning method to reduce the number of nodes that need to be judged whether to split by an interactive protocol. Extensive experiments on real datasets demonstrate that our DPST solution offers desirable data utility with low computation and communication costs.

Privacy-Preserving Utility Verification of the Data Published by Non-Interactive Differentially Private Mechanisms.

Differentially Private Data Publication with Multi-level Data Utility

Data-Driven Optimization for Utility Providers with Differential Privacy of Users' Energy Profile

Improving Utility of Differentially Private Mechanisms through Cryptography-based Technologies: a Survey

A Data Publishing System Based on Privacy Preservation

A Privacy Protection Model of Data Publication Based on Game Theory

Buying private data without verification

Horizontally Partitioned Data Publication with Differential Privacy

Secure Two-Party Differentially Private Data Release for Vertically Partitioned Data

A Privacy-Preserving User-Centric Data-Sharing Scheme

Differentially private data release for data mining

Wasserstein Markets for Differentially-Private Data

A Framework for Privacy-Preserving Data Publishing with Enhanced Utility for Cyber-Physical Systems

A Novel Privacy Preserving Method for Data Publication

A Data Synthesis Approach Based on Local Differential Privacy

Utility Preserving Secure Private Data Release

Multi-Party Sequential Data Publishing Under Differential Privacy

Universally Optimal Privacy Mechanisms for Minimax Agents

Privacy-preserving Governmental Data Publishing: A Fog-Computing-based Differential Privacy Approach.

Multi-level Privacy Preserving Data Publishing

Performance Evaluation of Differential Privacy Mechanisms in Blockchain based Smart Metering