Privacy-Preserving Similarity Search With Efficient Updates in Distributed Key-Value Stores

Wanyu Lin,Helei Cui,Baochun Li,Cong Wang
DOI: https://doi.org/10.1109/tpds.2020.3042695
IF: 5.3
2021-05-01
IEEE Transactions on Parallel and Distributed Systems
Abstract:Privacy-preserving similarity search plays an essential role in data analytics, especially when very large encrypted datasets are stored in the cloud. Existing mechanisms on privacy-preserving similarity search were not able to support secure updates (addition and deletion) efficiently when frequent updates are needed. In this article, we propose a new mechanism to support parallel privacy-preserving similarity search in a distributed key-value store in the cloud, with a focus on efficient addition and deletion operations, both executed with sublinear time complexity. If search accuracy is the top priority, we further leverage Yao's garbled circuits and the homomorphic property of Hash-ElGamal encryption to build a secure evaluation protocol, which can obtain the top-$R$<math>R</math> most accurate results without extensive client-side post-processing. We have formally analyzed the security strength of our proposed approach, and performed an extensive array of experiments to show its superior performance as compared to existing mechanisms in the literature. In particular, we evaluate the performance of our proposed protocol with respect to the time it takes to build the index and perform similarity queries. Extensive experimental results demonstrated that our protocol can speedup the index building process by up to 800× with 2 threads and the similarity queries by up to $sim $<math>∼</math>7× with comparable accuracy, as compared to the state-of-the-art in the literature.
computer science, theory & methods,engineering, electrical & electronic
What problem does this paper attempt to address?