Abstract:When users issue a query to a database, they have expectations about the results. If what they search for is unavailable in the database, the system will return an empty result or, worse, erroneous mismatch results. We call this problem the MisMatch problem. In this paper, we solve the MisMatch problem in the context of XML keyword search. Our solution is based on two novel concepts that we introduce: target node type and Distinguishability. Target Node Type represents the type of node a query result intends to match, and Distinguishability is used to measure the importance of the query keywords. Using these concepts, we develop a low-cost post-processing algorithm on the results of query evaluation to detect the MisMatch problem and generate helpful suggestions to users. Our approach has three noteworthy features: (1) for queries with the MisMatch problem, it generates the explanation, suggested queries and their sample results as the output to users, helping users judge whether the MisMatch problem is solved without reading all query results; (2) it is portable as it can work with any lowest common ancestor-based matching semantics (for XML data without ID references) or minimal Steiner tree-based matching semantics (for XML data with ID references) which return tree structures as results. It is orthogonal to the choice of result retrieval method adopted; (3) it is lightweight in the way that it occupies a very small proportion of the whole query evaluation time. Extensive experiments on three real datasets verify the effectiveness, efficiency and scalability of our approach. A search engine called XClear has been built and is available at http://xclear.comp.nus.edu.sg.

Keywords filtering over probabilistic XML data

A System for Keyword Search on Probability XML Data

Keyword Search on Probabilistic XML Data Based on ELM

Keywords Query of uncertain spatiotemporal data based on XML

Finding and Ranking Compact Connected Trees for Effective Keyword Proximity Search in XML Documents

Data Warehouse Native Feature Based OLAP Querying with Keywords

Keyword Searches in Data-Centric XML Documents Using Tree Partitioning

No Tag, a Little Nesting, and Great XML Keyword Search.

Efficient Keyword Search Over Data-Centric Xml Documents

Effective keyword search in XML documents based on MIU

A General Framework to Resolve the MisMatch Problem in XML Keyword Search

Adaptive And Effective Keyword Search For Xml

Return Specification Inference and Result Clustering for Keyword Search on XML

Keyword Search on XML Data

Foundation Of Keyword Search In Xml

XTree: A New XML Keyword Retrieval Model

Processing Keyword Search on XML: a Survey

Efficient XML Keyword Search: From Graph Model to Tree Model.

Efficient Top-K Algorithm for Extensible Markup Language Keyword Search

An Efficient And Flexible Approach Of Keyword Search In Xml

Keyword Search Based XML Data Source Selection