Towards answering analytical query over hierarchical histogram under untrusted servers

Congcong Fu,Hui Li,Jian Lou,Jiangtao Cui
DOI: https://doi.org/10.1007/s10619-024-07447-3
IF: 0.974
2024-11-16
Distributed and Parallel Databases
Abstract:Hierarchical count histograms involve the publication of count statistics at various granularities based on a predefined hierarchy within a dimension table in a data warehouse. This task finds extensive applications in on-line analytical processing (OLAP) scenarios. This paper focuses on the rigorous privacy-preserving constraint when dealing with an untrusted server. We conduct a systematic investigation of this task and uncover the limitations of the straightforward baseline approach using local differential privacy, as it fails to strike an optimal balance between privacy and utility. We are thus motivated to propose DP-HORUS, a novel crypto-assisted D ifferentially P rivate framework for H ierarchical c O unt histog R ams under U ntrusted S erver. DP-HORUS consists of a series of novel designs, including (1) Encrypted hierarchical tree (EHT) structure, which maintains the concept hierarchy in the input data; (2) Random matrix (RM), which reduces communication and computational cost; (3) To further boosted the utility, we propose DP-HORUS+ encompassing two additional modules of histograms structure (HS) and hierarchical consistency (HC), which are respectively introduced to reduce the noise caused by data sparsity and to ensure the hierarchy consistency; (4) To further boost the robust performance, we propose a series of schemes for workload queries based on DP-HORUS. We provide both theoretical analysis and extensive empirical study on both real-world and synthetic datasets, which demonstrates the superior utility of the proposed methods over the state-of-the-art solutions while ensuring strict privacy guarantee.
computer science, information systems, theory & methods
What problem does this paper attempt to address?