Peer2PIR: Private Queries for IPFS

Miti Mazmudar,Shannon Veitch,Rasoul Akhavan Mahdavi
2024-11-26
Abstract:The InterPlanetary File System (IPFS) is a peer-to-peer network for storing data in a distributed file system, hosting over 190,000 peers spanning 152 countries. Despite its prominence, the privacy properties that IPFS offers to peers are severely limited. Any query within the network leaks the queried content to other peers. We address IPFS' privacy leakage across three functionalities (peer routing, provider advertisements, and content retrieval), ultimately empowering peers to privately navigate and retrieve content in the network. Our work highlights and addresses novel challenges inherent to integrating PIR into distributed systems. We present our new, private protocols and demonstrate that they incur reasonably low communication and computation overheads. We also provide a systematic comparison of state-of-art PIR protocols in the context of distributed systems.
Cryptography and Security
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the privacy leakage issue in the IPFS (InterPlanetary File System) network. Although IPFS, as a distributed file system, performs excellently in the decentralized network, the user privacy protection provided by its current model is very limited. Specifically, any query in IPFS will disclose the content being queried to other nodes. To meet this challenge, the paper proposes a comprehensive solution, aiming to achieve user privacy protection when querying and retrieving content by improving the three main functions of IPFS (peer routing, provider advertisements, and content retrieval). ### Specific manifestations of the problem: 1. **Peer Routing**: When a node queries the location of another node, the intermediate nodes will know the target node ID. 2. **Provider Advertisements**: When a node queries which node provides a certain file, the server node will know the target CID (Content Identifier). 3. **Content Retrieval**: When a node retrieves a file from another node, the server node will know the target CID and its content. These problems lead to serious leakage of user privacy because any query will let the intermediate nodes or the final nodes know what the user is looking for. ### Solutions in the paper: The paper proposes a new method named Peer2PIR, which uses the Private Information Retrieval (PIR) technology to solve the above - mentioned privacy problems. Specifically: - **Private Peer Routing**: By modifying the routing table and combining with the PIR protocol, the server cannot know the target node ID when responding to the query. - **Private Provider Advertisements**: By storing the provider advertisements in a processed data structure and querying with the PIR protocol, it is ensured that the server cannot know the target CID. - **Private Content Retrieval**: By modifying the content retrieval protocol, it is ensured that the server cannot know the specific CID and its content requested by the client. ### Main contributions: 1. **Comprehensive privacy protection**: This is the first end - to - end privacy protection scheme for IPFS, which not only protects the privacy of the query content but also protects the privacy of the finally retrieved content. 2. **Efficient PIR protocols**: The paper proposes some new PIR protocols, which perform well in terms of communication and computational overhead, and are especially suitable for small databases. 3. **Systematic comparison and optimization**: A systematic comparison of the existing PIR protocols has been carried out, and optimization has been carried out for the DHT environment. Through these improvements, the paper shows that it is feasible to achieve efficient and secure privacy protection in IPFS, and has reasonable latency and resource consumption in practical applications.