The OHSUMED Dataset in LETOR

Jun Xu,Tie-Yan Liu,Hang Li
2007-01-01
Abstract:OHSUMED is one dataset available in the LETOR package. This dataset contains features extracted from query-document pairs in the OHSUMED collection, and the corresponding relevance labels. It also includes the evaluation results of several baseline ranking algorithms using the data. In this document, we first introduce the original OHSUMED collection, and then the features. After that, we describe the training, validation and test sets prepared, as well as the baseline experimental results using the data.
What problem does this paper attempt to address?