An Entropy-based Approach to the Crowd Entity Resolution

Yi Jiang,Wei Zhang,Haiyan Zhao
DOI: https://doi.org/10.1145/2875913.2875936
2015-01-01
Abstract:Crowdsourcing is used to obtain needed ideas and content by soliciting data from a large group of people, especially from an online community. However, the data generated by a group of people is duplicated. As to learn the crowd intention based on the crowd data, we need to do some entity resolution works. Previous works focus on data matching and merging, but remain far from perfect in crowdsourcing area. In our study, we propose a generic way in measuring and representing the crowd intention based on the crowd data. The main contribution of our study is twofold: 1. We propose a graph structure that represents the crowd intention. 2. We propose an entropy-based measurement that evaluates the diversity of the crowd intention.
What problem does this paper attempt to address?