Recovering Missing Labels of Crowdsourcing Workers.

Qingyang Hu,Kevin Chiew,Hao Huang,Qinming He
DOI: https://doi.org/10.1137/1.9781611973440.98
2014-01-01
Abstract:Previous chapter Next chapter Full AccessProceedings Proceedings of the 2014 SIAM International Conference on Data Mining (SDM)Recovering Missing Labels of Crowdsourcing WorkersQingyang Hu, Kevin Chiew, Hao Huang, and Qinming HeQingyang Hu, Kevin Chiew, Hao Huang, and Qinming Hepp.857 - 865Chapter DOI:https://doi.org/10.1137/1.9781611973440.98PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract Data sets collected from crowdsourcing platforms are well known for their cheap costs. But cheap costs may lead to low quality, i.e., labels may be incorrect or missing. Most of the existing work focuses on modeling the labeling errors of crowd workers, but missing labels can also cause problems when modeling the data. In this paper, we present an algorithm to predict the missing labels of crowd workers, in which we adopt thoughts from semi-supervised learning and utilize the particular consistency between crowd workers. We also define the consistency between workers by crowd labels and develop an algorithm to learn them from the data automatically. Experiments on both benchmark and real data show that our algorithm outperforms traditional semi-supervised learning algorithms in predicting missing labels, and the recovered crowd labels are capable of predicting the ground truth and reflecting real properties of crowd workers. Previous chapter Next chapter RelatedDetails Published:2014eISBN:978-1-61197-344-0 https://doi.org/10.1137/1.9781611973440Book Series Name:ProceedingsBook Code:PRDT14Book Pages:1-1086
What problem does this paper attempt to address?