Abstract:The queries issued to search engines are often ambiguous or multifaceted, which requires search engines to return diverse results that can fulfill as many different information needs as possible; this is called search result diversification. Recently, the relational learning to rank model, which designs a learnable ranking function following the criterion of maximal marginal relevance, has shown effectiveness in search result diversification [Zhu et al. 2014]. The goodness of a diverse ranking model is usually evaluated with diversity evaluation measures such as α-NDCG [Clarke et al. 2008], ERR-IA [Chapelle et al. 2009], and D&num;-NDCG [Sakai and Song 2011]. Ideally the learning algorithm would train a ranking model that could directly optimize the diversity evaluation measures with respect to the training data. Existing relational learning to rank algorithms, however, only train the ranking models by optimizing loss functions that loosely relate to the evaluation measures. To deal with the problem, we propose a general framework for learning relational ranking models via directly optimizing any diversity evaluation measure. In learning, the loss function upper-bounding the basic loss function defined on a diverse ranking measure is minimized. We can derive new diverse ranking algorithms under the framework, and several diverse ranking algorithms are created based on different upper bounds over the basic loss function. We conducted comparisons between the proposed algorithms with conventional diverse ranking methods using the TREC benchmark datasets. Experimental results show that the algorithms derived under the diverse learning to rank framework always significantly outperform the state-of-the-art baselines.

Diversified Search Evaluation: Lessons from the NTCIR-9 INTENT Task

Evaluating diversified search results using per-intent graded relevance.

The Impact Of Intent Selection On Diversified Search Evaluation

Simple Evaluation Metrics for Diversified Search Results.

The Reusability of a Diversified Search Test Collection.

Evaluating Search Result Diversity Using Intent Hierarchies.

Estimating Intent Types for Search Result Diversification.

Summary Of The Ntcir-10 Intent-2 Task: Subtopic Mining And Search Result Diversification

Improve Web Search Diversification with Intent Subtopic Mining

Low-cost, Bottom-Up Measures for Evaluating Search Result Diversification

Summary of the NTCIR-10 INTENT-2 task

Search Result Diversity Evaluation Based on Intent Hierarchies.

A Subtopic Taxonomy-Aware Framework for Diversity Evaluation.

Efficient Diversification of Web Search Results

Revisiting The Evaluation Of Diversified Search Evaluation Metrics With User Preferences

Multi-dimensional Search Result Diversification

Directly Optimize Diversity Evaluation Measures: A New Approach to Search Result Diversification.

Search Result Diversification Based on Hierarchical Intents.

User Session Level Diverse Reranking of Search Results

Microsoft Research Asia at the Web Track of TREC 2009.

Search Result Diversification via Filling Up Multiple Knapsacks.