SparkRDF: In-Memory Distributed RDF Management Framework for Large-Scale Social Data.

Zhichao Xu,Wei Chen,Lei Gai,Tengjiao Wang
DOI: https://doi.org/10.1007/978-3-319-21042-1_27
2015-01-01
Abstract:Considering the scalability and semantic requirements, Resource Description Framework (RDF) and the de-facto query language SPARQL are well suited for managing and querying online social network (OSN) data. Despite some existing works have introduced distributed framework for querying large-scale data, how to improve online query performance is still a challenging task. To address this problem, this paper proposes a scalable RDF data framework, which uses key-value store for offline RDF storage and pipelined in-memory based query strategy. The proposed framework efficiently supports SPARQL Basic Graph Pattern (BGP) queries on large-scale datasets. Experiments on the benchmark dataset demonstrate that the online SPARQL query performance of our framework outperforms existing distributed RDF solutions.
What problem does this paper attempt to address?