When does Relevance Mean Usefulness and User Satisfaction in Web Search?

Jiaxin Mao,Yiqun Liu,Ke Zhou,Jian-Yun Nie,Jingtao Song,Min Zhang,Shaoping Ma,Jiashen Sun,Hengliang Luo
DOI: https://doi.org/10.1145/2911451.2911507
2016-01-01
Abstract:Relevance is a fundamental concept in information retrieval (IR) studies. It is however often observed that relevance as annotated by secondary assessors may not necessarily mean usefulness and satisfaction perceived by users. In this study, we confirm the difference by a laboratory study in which we collect relevance annotations by external assessors, usefulness and user satisfaction information by users, for a set of search tasks. We also find that a measure based on usefulness rather than relevance annotated has a better correlation with user satisfaction. However, we show that external assessors are capable of annotating usefulness when provided with more search context information. In addition, we also show that it is possible to generate automatically usefulness labels when some training data is available. Our findings explain why traditional system-centric evaluation metrics are not well aligned with user satisfaction and suggest that a usefulness-based evaluation method can be defined to better reflect the quality of search systems perceived by the users.
What problem does this paper attempt to address?