Abstract:In Peer-to-Peer context, a challenging problem is how to find the appropriate peer to deal with a given query without overly consuming bandwidth? Different methods proposed routing strategies of queries taking into account the P2P network at hand. This paper considers an unstructured P2P system based on an organization of peers around Super-Peers that are connected to Super-Super- Peer according to their semantic domains; By analyzing the queries log file, a predictive model that avoids flooding queries in the P2P network is constructed after predicting the appropriate Super-Peer, and hence the peer to answer the query. A challenging problem in a schema-based Peer-to-Peer (P2P) system is how to locate peers that are relevant to a given query. In this paper, architecture, based on (Super-)Peers is proposed, focusing on query routing. The approach to be implemented, groups together (Super-)Peers that have similar interests for an efficient query routing method. In such groups, called Super-Super-Peers (SSP), Super-Peers submit queries that are often processed by members of this group. A SSP is a specific Super-Peer which contains knowledge about: 1. its Super-Peers and 2. The other SSP. Knowledge is extracted by using data mining techniques (e.g. Decision Tree algorithms) starting from queries of peers that transit on the network. The advantage of this distributed knowledge is that, it avoids making semantic mapping between heterogeneous data sources owned by (Super-)Peers, each time the system decides to route query to other (Super-) Peers. The set of SSP improves the robustness in queries routing mechanism, and the scalability in P2P Network. Compared with a baseline approach,the proposal architecture shows the effect of the data mining with better performance in respect to response time and precision.

Survey on Distributed Data Mining in P2P Networks

Modeling and Performance Analysis of Unstructured P2P Network

Distributed data mining for e-business

Survey of Search and Replication Schemes in Unstructured P2P Networks

Use P2P for Online Transaction and Electronic Marketplace

P2P Simulator for Queries Routing using Data Mining

Secure P2P Topology Based on a Multidimensional DHT Space Mapping

Queries mining for efficient routing in P2P communities

A Problem Oriented Approach to Data Mining in Distributed Spatio-temporal Database

A Study of Algorithms, Systems, and Applications of Multi-Agent Systems for Distributed Data Mining

Data Management in Peer-To-Peer Environment: A Perspective of Bestpeer

A Survey on the Use of P2P Technology for Network Management

P2P Domain Classification using Decision Tree

A Brief Study of Privacy-Preserving Practices (PPP) in Data Mining

A Communication Efficient and Scalable Distributed Data Mining for the Astronomical Data

Query Routing and Processing in Peer-To-Peer Data Sharing Systems

A distributed approach to node clustering in decentralized peer-to-peer networks

Distributed Data Mining Based on Multi-agent System

Grid-based Approaches for Distributed Data Mining Applications

A P2P Computing Approach to Remote Sensing Data Distributed Processing on High-speed LAN

Distributed Knowledge Discovery with Non Linear Dimensionality Reduction