A Clustering Retrieval System Of Chinese Information

Sin-Guang Sha,Yuan-Chao Liu,Ming Liu,Xiao-Long Wang
DOI: https://doi.org/10.1109/NLPKE.2008.4906815
2009-01-01
Abstract:This paper proposed a novel clustering retrieval system. This system first extracts and ranks salient phrases as candidate cluster theme, based on regression model of SVR (Support Vector Regression) learned from human labeled training data. The retrieval documents are assigned to relevant salient phrases to form candidate clusters, and the final clusters are generated by merging these candidate clusters. This paper also searches for a reasonable format to display the final themes of clusters, in order to help users to find the interested documents easily. Experiment results verified our method feasible and effective.
What problem does this paper attempt to address?