Optimization of RDF data storage and query based on Hadoop

Dezhi Xu,Yang Liu,Ahmed Sarfraz
DOI: https://doi.org/10.3969/j.issn.1001-3695.2017.02.035
2017-01-01
Abstract:With the rapid growth of RDF data,it has become a research hotspot for techniques based on distributed systems to store and manage them.In order to store and query large-scale RDF data with high efficiency,this paper discussed several methods based on Hadoop platform and then proposed a novel method towards optimization of RDF data storage and query.It adopted a hybrid data layout in HBase and an I/O cost model of join query in MapReduce to optimize the RDF data query.Experiments on LUBM show that the method performs well in space efficiency and time efficiency towards complex query of RDF data.
What problem does this paper attempt to address?