Data Preprocessing For Learning To Rank

Tie-Yan Liu
DOI: https://doi.org/10.1007/978-3-642-14267-3_13
2011-01-01
Abstract:This chapter is concerned with data processing for learning to rank. In order to learn an effective ranking model, the first step is to prepare high-quality training data. There are several important issues to be considered regarding the training data. First, it should be considered how to get the data labeled on a large scale but at a low cost. Click-through log mining is one of the feasible approaches for this purpose. Second, since the labeled data are not always correct and effective, selection of the queries and documents, as well as their features should also be considered. In this chapter, we will review several pieces of previous work on these topics, and also make discussions on the future work.
What problem does this paper attempt to address?