Privacy-Preserving Graph Embedding based on Local Differential Privacy

Zening Li,Rong-Hua Li,Meihao Liao,Fusheng Jin,Guoren Wang
2024-08-04
Abstract:Graph embedding has become a powerful tool for learning latent representations of nodes in a graph. Despite its superior performance in various graph-based machine learning tasks, serious privacy concerns arise when the graph data contains personal or sensitive information. To address this issue, we investigate and develop graph embedding algorithms that satisfy local differential privacy (LDP). We introduce a novel privacy-preserving graph embedding framework, named PrivGE, to protect node data privacy. Specifically, we propose an LDP mechanism to obfuscate node data and utilize personalized PageRank as the proximity measure to learn node representations. Furthermore, we provide a theoretical analysis of the privacy guarantees and utility offered by the PrivGE framework. Extensive experiments on several real-world graph datasets demonstrate that PrivGE achieves an optimal balance between privacy and utility, and significantly outperforms existing methods in node classification and link prediction tasks.
Cryptography and Security,Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
The paper attempts to address the issue of protecting node data privacy during graph embedding, especially when the graph data contains personal or sensitive information. Although graph embedding performs excellently in various graph-based machine learning tasks, it raises serious privacy concerns when the graph data includes sensitive information. To this end, the authors study and develop a graph embedding algorithm that satisfies Local Differential Privacy (LDP), aiming to protect the privacy of node data. Specifically, the paper proposes a novel privacy-preserving graph embedding framework named PrivGE, which introduces an LDP mechanism to obfuscate node data and uses personalized PageRank as a proximity measure to learn node representations. Additionally, the authors provide a theoretical analysis of the privacy protection and utility of the PrivGE framework and validate through experiments on multiple real-world graph datasets that PrivGE achieves the best balance between privacy and utility in node classification and link prediction tasks, significantly outperforming existing methods.