A Subjectivity-Aware Algorithm for Label Aggregation in Crowdsourcing

Ming Wu,Qianmu Li,Shuo Wang,Jun Hou
DOI: https://doi.org/10.1109/cse/euc.2019.00077
2019-01-01
Abstract:Crowdsourcing has already attracted a wide attention in the field of machine learning and its related fields. A large amount of labeled data can be obtained quickly and cheaply on crowdsourcing platforms. To deal with the problem that labels collected from crowds are usually noisy due to the low accuracy of non-expert online workers, we use quality control methods to improve the qualities of crowd data. Unfortunately, current quality control methods only consider the instance difficulty or the worker reliability to account for the variety of labels to the same instance, and these methods did not take subjectivity of workers into consideration which also effects the responses. In this paper, we present a novel subjectivity-aware algorithm for label aggregation, which also model the difficulty of instances and reliability of workers as latent parameters. This method is an EM-like algorithm, which not only infers the ground truth of the instances, but also simultaneously estimates the latent parameters. Experimental results on real-world datasets show that our method outperforms the state-of-the-art ground truth inference algorithms.
What problem does this paper attempt to address?