THUIR at TREC 2009 Web Track: Finding Relevant and Diverse Results for Large Scale Web Search

F Chen,Tong Zhu,Bin Zhou,Maomao Zhang
2009-01-01
Abstract:This is the 8th year that IR group of Tsinghua University (THUIR) participates in TREC. This year we focus on Web track, which contains two tasks, namely ad hoc and diversity. On ad hoc task, we improved the efficiency of our distributed retrieval system TMiner to handle terabytes of Web data. Then three studies have been done, namely page quality estimation, ranking feature analysis, and model comparison. On task, we proposed several new approaches on strategy, intention detection, and duplication elimination. To mine users intention, we proposed and compared two different strategies, namely + content-based diversity which is a kind of result clustering, and user based diverse intention prediction + searching which is in the branch of query expansion.
What problem does this paper attempt to address?