Scalable Routing Algorithm for Partial Matches in P2P Systems

周晋,路海明,卢增祥,李衍达
DOI: https://doi.org/10.3321/j.issn:1000-0054.2004.10.025
2004-01-01
Abstract:The ideal routing algorithm in a P2P system should not only provide the IR algorithms' effectiveness, but also guarantee the router's scalability. This paper describes the distributed hierarchical clustering that meets both objectives. The algorithm is based on a state-of-the-art document ranking algorithm and the vector-space model, through which the files in a P2P network are denoted as vectors. The file content vectors are used to cluster a given content to produce a content clustering tree using a hierarchical clustering algorithm. The vectors are stored in a fully distributed manner. The path length of the routings in the cluster tree will be no longer than the tree depth. Simulations show that, with distributed hierarchical clustering, search recalls are more than eighty percent successful with lookup and join/departure costs both scaling logarithmically with the number of physical nodes.
What problem does this paper attempt to address?