Technologies for Search and Data Retrieval in Peer-to-Peer Networks

Jiang Bian
2006-01-01
Abstract:Peer-to-peer (P2P) systems are distributed systems in which nodes of equal roles and capabilities exchange information and services directly with each other. In recent years, P2P has emerged as a popular way to share huge volumes of data. Sharing such large volumes of data is made possible by distributing the main costs disk space for storing the files and bandwidth for transferring them across the peers in the network. In addition, peers form self-organization overlay networks that are overlayed on the Internet Protocol (IP) networks, offering a mix of various features such as robust wide-area routing architecture, efficient search of data items, selection of nearby peers, redundant storage, permanence, trust and authentication, massive scalability and fault tolerance. Because of these desirable qualities, many research projects have been focused on understanding the issues surrounding these systems and improving their performance. The key to the usability of a data-sharing P2P system, and one of the most challenging design aspects, is efficient techniques for search and retrieval of data. The best search techniques for a given system depend on the needs of the application. For storage or archival systems focusing on availability, search techniques such as Chord[17], Pastry[15], Tapestry[23] and CAN[14] are well-suited, because they guarantee location of content if it exists, within a bounded number of hops. To achieve these properties, these techniques tightly control the data placement and topology within the network, and currently only
What problem does this paper attempt to address?