Extracting a Diverse Information Subset by Considering Information Coverage and Redundancy Simultaneously

Baojun Ma,Qiang Wei,Guoqing Chen,Qiongwei Ye
DOI: https://doi.org/10.1142/9789813273238_0074
2018-01-01
Abstract:Information overload has been a big challenge for web users to find the information they want or are interested in. To extract or provide a small set of diverse result subset is valuable and important to both information providers and users. This paper proposes a heuristic algorithm named CovRedSA-Select by considering information coverage and redundancy simultaneously based on the strategy of simulated annealing. Furthermore, the comparative experiments reveal performances advantageous over other related methods.
What problem does this paper attempt to address?