Refining the Results of Automatic e-Textbook Construction by Clustering

Jing Chen,Qing Li,Ling Feng
DOI: https://doi.org/10.1007/11528043_31
2007-01-01
Abstract: The abundance of knowledge-rich information on the World Wide Web makes compiling an online e-textbook both possible and necessary. The authors of [7] proposed an approach to automatically generate an e-textbook by mining the ranking lists of the search engine. However, the performance of the approach was degraded by Web pages that were relevant but not actually discussing the desired concept. In this paper, we extend the work in [7] by applying a clustering approach before the mining process. The clustering approach serves as a post-processing stage to the original results retrieved by the search engine, and aims to reach an optimum state in which all Web pages assigned to a concept are discussing that exact concept.
What problem does this paper attempt to address?