A keyword query approach on RDF data

李慧颖,瞿裕忠
DOI: https://doi.org/10.3969/j.issn.1001-0505.2010.02.010
2010-01-01
Abstract:Based on the keyword inverted-list index and the path index,a heuristic searching algorithm is proposed.The algorithm uses the cost-balanced strategy and the equi-distance strategy to find the top-k answers.Resource description framework (RDF) data is modeled as an RDF sentence graph,and all text information is encapsulated by the sentence nodes.An answer to a keyword query is an RDF sentence tree which contains all the keywords,and all the leaf nodes are relevant to keywords.Therefore,to find a shortest answer tree is a Steiner tree problem.Supposing that there are n nodes in RDF sentence graph,the index space would be 3n~2 in the worst case.Supposing that there are k relevant nodes,the time complexity would be O(kn).The proposed approach supports keywords that match attributes and relation contained in the data,without the information of the RDF data schema.The experimental results show that the approach is feasible and effective.
What problem does this paper attempt to address?