Approximation Algorithms for Aggregate Queries on Uncertain Data

CHEN Donghui,CHEN Ling,WANG Junkai,WU Yong,WANG Jingchang
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2018.26.015
2018-01-01
Abstract:Analyses of big data sets often require aggregate queries on uncertain data with various types of data that are computationally complex.In this paper,the results of aggregate queries on uncertain data are defined to include all possible values and their corresponding probabilities.Dynamic programming is then used to solve the Distribution Sum (DSUM) algorithm using a Greedy-based Distribution Sum and a Binary Merge based Distribution Sum approximation algorithms,which both can be applied to tuple-level and attribute-level uncertainty models.The time and space complexities of the algorithms are determined theoretically as well as the error range of the results.Tests demonstrates that these two approximation algorithms with a 1% allowable error shorten the execution times by 15%-21% and 22% 32%,respectively.
What problem does this paper attempt to address?