Scalable Semantic Search with Hybrid Concept Index over Structure Peer-to-Peer Network

Wenhui Ma,Gang Wang,Jing Liu
DOI: https://doi.org/10.1109/GCC.2007.110
2007-01-01
Abstract:The primary challenge in developing a peer-topeer( P2P) file sharing system is implementing an efficient keyword search mechanism. Current keyword search approaches for structured P2P networks are built on the distributed inverted index by keywords. However, when executing multiple-attribute queries, they suffer from the problem of unscalable bandwidth consumption. Moreover, these approaches only support literally word match, not taking into account the meaning of word. In this paper, we propose an efficient keyword search mechanism over structure P2P network. Peers use a shared ontology to describe the content of a document and the subject of a query. A distributed hybrid concept index is constructed, which efficiently supports the query routing and matching, and avoids the intersection of inverted list among peers, which is cause of unscallabe network bandwidth consumption. Based on the semantic similarity between the subjects of queries and the contents of documents, peers can get results matching their queries semantically, instead of literally word match. Simulation experiments show that keyword search with the approach proposed in this paper is much less on bandwidth costs and much higher on retrieval perform than that based on standard inverted index by keywords.
What problem does this paper attempt to address?