A Distributed Inverted Indexing Scheme for Large-Scale RDF Data.

Xu Li,Xin Wang,Hong Shi,Zhaohua Sheng,Zhiyong Feng
DOI: https://doi.org/10.1007/978-3-642-33050-6_16
2012-01-01
Abstract:With the development of the Linked Data project, enormous RDF data have been published on the Web. A scalable system is required to provide an efficient retrieval for large-scale RDF data. This paper presents a distributed inverted indexing scheme for large-scale RDF data. A scalable inverted index is built using the underlying data structure of Cassandra which is a distributed key-value storage system. We optimize the indexing scheme with the characteristics of RDF data model to effectively support the fast keyword search. The loading, encoding and indexing procedures are implemented for RDF data simultaneously using the MapReduce framework. The experimental results show that our indexing scheme can effectively support keyword retrieval for large-scale RDF data. © 2012 Springer-Verlag.
What problem does this paper attempt to address?