TREC 2005 Genomics Track Experiments at DUTAI.

Zhihao Yang,Hongfei Lin,Yanpeng Li,Baoyan Liu,Ye Lu
DOI: https://doi.org/10.6028/nist.sp.500-266.genomics-dalianu.yang
2005-01-01
Abstract:This paper describes the techniques we applied for the two tasks of the TREC Genomics track, i.e., ad hoc retrieval and categorization tasks. For the ad hoc retrieval task, we used query expansion, different scoring strategy on different parts of Medline record (Title, Abstract, RN, MH, etc.) and pseudo relevance feedback. Our submitted run DUTAdHoc2 obtained a MAP of 0.2349. For the categorization task, our system used a SVM classifier with TFIDF term weighting scheme. In addition concept replacing and filtering methods were adopted. Two of our submitted runs (eDUTCat1 and gDUTCat1) produced a Utility score of 0.8496 and 0.572 respectively ranking third and fourth out of 46 runs submitted for the categorization task.
What problem does this paper attempt to address?