A Decentralized Private Data Marketplace using Blockchain and Secure Multi-Party Computation

Julen Bernabé-Rodríguez,Albert Garreta,Oscar Lage
DOI: https://doi.org/10.1145/3652162
IF: 2.717
2024-03-16
ACM Transactions on Privacy and Security
Abstract:Big data has proven to be a very useful tool for companies and users, but companies with larger datasets have ended being more competitive than the others thanks to machine learning or artificial inteligence. Secure multi-party computation (SMPC) allows the smaller companies to jointly train arbitrary models on their private data while assuring privacy, and thus gives data owners the ability to perform what are currently known as federated learning algorithms. Besides, with a blockchain it is possible to coordinate and audit those computations in a decentralized way. In this document, we consider a private data marketplace as a space where researchers and data owners meet to agree the use of private data for statistics or more complex model trainings. This document presents a candidate architecure for a private data marketplace by combining SMPC and a public, general-purpose blockchain. Such a marketplace is proposed as a smart contract deployed in the blockchain, while the privacy preserving computation is held by SMPC.
computer science, information systems
What problem does this paper attempt to address?
The problem this paper attempts to address is that in the era of big data, small companies find it difficult to remain competitive against large companies with vast amounts of data due to their insufficient data volume. To solve this issue, the paper proposes a decentralized private data market platform that combines blockchain and Secure Multi-Party Computation (SMPC) technologies. Specifically, the paper focuses on the following core issues: 1. **Data Privacy Protection**: How to allow multiple data owners to jointly train machine learning or artificial intelligence models without disclosing their data. 2. **Decentralized Collaboration**: How to achieve decentralized coordination and auditing between data owners and researchers through blockchain technology. 3. **Economic Incentive Mechanism**: How to design a market mechanism that encourages data owners to share their private data and receive economic rewards. 4. **Computational Resource Constraints**: How to effectively execute secure multi-party computation with limited computational capabilities of participants, avoiding the centralization of computing nodes. The main contributions of the paper include: - Proposing a decentralized data market architecture that combines SMPC and blockchain technology. - Designing a grouping protocol and reputation system to reduce errors in SMPC execution. - Implementing performance tests of basic statistical and machine learning algorithms on the SMPC protocol. - Proposing an incentive system design between algorithm owners and data owners. Through these technologies and mechanisms, the paper aims to create a secure, decentralized, and economically viable data market platform, enabling small companies to also leverage the advantages of big data.