Pairwise, Magnitude, or Stars: What's the Best Way for Crowds to Rate?

Alessandro Checco,Gianluca Demartini
DOI: https://doi.org/10.48550/arXiv.1609.00683
2016-09-03
Abstract:We compare three popular techniques of rating content: the ubiquitous five star rating, the less used pairwise comparison, and the recently introduced (in crowdsourcing) magnitude estimation approach. Each system has specific advantages and disadvantages, in terms of required user effort, achievable user preference prediction accuracy and number of ratings required. We design an experiment where the three techniques are compared in an unbiased way. We collected 39'000 ratings on a popular crowdsourcing platform, allowing us to release a dataset that will be useful for many related studies on user rating techniques.
Information Retrieval,Human-Computer Interaction
What problem does this paper attempt to address?