Abstract:Search result diversification aims to diversify search results to cover different query subtopics, i.e., pieces of relevant information. The state of the art diversification methods often explicitly model the diversity based on query subtopics, and their performance is closely related to the quality of subtopics. Most existing studies extracted query subtopics only from the unstructured data such as document collections. However, there exists a huge amount of information from structured data, which complements the information from the unstructured data. The structured data can provide valuable information about domain knowledge, but is currently under-utilized. In this article, we study how to leverage the integrated information from both structured and unstructured data to extract high quality subtopics for search result diversification. We first discuss how to extract subtopics from structured data. We then propose three methods to integrate structured and unstructured data. Specifically, the first method uses the structured data to guide the subtopic extraction from unstructured data, the second one uses the unstructured data to guide the extraction, and the last one first extracts the subtopics separately from two data sources and then combines those subtopics. Experimental results in both Enterprise and Web search domains show that the proposed methods are effective in extracting high quality subtopics from the integrated information, which can lead to better diversification performance.

Results Diversification for Keyword Search Using Semantic Information of Entity

Results Diversification for XML Keyword Search Based on the Semantic Category of Central Entity

Reserch of Entity Matching Based on Multiple Heterogenous Data

Semantic Relevance Ranking for XML Keyword Search.

Search Result Diversification for Enterprise Data

Knowledge Enhanced Search Result Diversification

Return Specification Inference and Result Clustering for Keyword Search on XML

Multi-dimensional Search Result Diversification

Efficient Keyword Search over Graph-structured XML Documents

A Survey on Search Result Diversification

Semantic-Distance based clustering for XML keyword search

XSeek: A Semantic XML Search Engine Using Keywords.

SPECIAL AREA ORIENTED SEMANTIC SEARCH RESULTS RANKING ALGORITHM

Diversified and Verbalized Result Summarization for Semantic Association Search

Leveraging Integrated Information to Extract Query Subtopics for Search Result Diversification

Diversified Spatial Keyword Search on RDF Data

A Semantics-Based Method For Clustering Of Chinese Web Search Results

Keyword-based Search on Semantic Web Data:The State of the Art

Effective entity unit for XML keyword search

Keymantices :Semantic Keyword Entity Search Mechanism in Dataspaces

A Keyword Based Prototype for Web Search Result Diversification