Is learning to rank effective for Web search?

Min Zhang,Da Kuang,Guichun Hua,Yiqun Liu,Shaoping Ma
2009-01-01
Abstract:LETOR, the benchmark collection for learning to rank, helps make comparative study on different approaches in experimental research. Since the collection is constructed mainly based on TREC datasets, queries and documents in LETOR differ from true Web search scenario on some aspects, such as its incomplete link information, limited documents' domain, and lack of user click information. Hence the observations derived by the collection could be different from that in real Web environment. This paper empirically studies the effectiveness of the state-of-art learning to rank algorithms, especially in Web search scenario. Besides LETOR, a Web search collection is constructed based on the search log of a commercial search engine. Five approaches have been studied, including linear regression, RankBoost, ListNet, top k optimization of ListMLE, and SVM-MAP. Comparative study has been made among algorithms and across different datasets. Furthermore, the effects of learning to rank algorithms are compared with that of content-based and link-based ranking features. Essential differences have been observed and analyzed in the paper in terms of the effectiveness and stability of the algorithms and the feature selection. We believe this study will help the Web search community for better knowledge about whether and to what extent learning to rank algorithms are effective in real applications.
What problem does this paper attempt to address?