TreeCluster: clustering results of keyword search over databases

Zhaohui Peng,Jun Zhang,Shan Wang,Lu Qin
DOI: https://doi.org/10.1007/11775300_33
2006-01-01
Abstract:A critical challenge in keyword search over relational data- bases (KSORD) is to improve its result presentation to facilitate users' quick browsing through search results. An effective method is to organize the results into clusters. However, traditional clustering method is not applicable to KSORD search results. In this paper, we propose a novel clustering method named TreeCluster. In the first step, we use labels to represent schema information of each result tree and reformulate the clustering problem as a problem of judging whether labeled trees are isomorphic. In the second step, we rank user keywords according to their frequencies in databases, and further partition the large clusters based on keyword nodes. Furthermore, we give each cluster a readable description, and present the description and each result graphically to help users understand the results more easily. Experimental results verify our method's effectiveness and efficiency.
What problem does this paper attempt to address?