Processing K-Skyband, Constrained Skyline, and Group-by Skyline Queries on Incomplete Data

Yunjun Gao,Xiaoye Miao,Huiyong Cui,Gang Chen,Qing Li
DOI: https://doi.org/10.1016/j.eswa.2014.02.033
IF: 8.5
2014-01-01
Expert Systems with Applications
Abstract:The skyline operator has been extensively explored in the literature, and most of the existing approaches assume that all dimensions are available for all data items. However, many practical applications such as sensor networks, decision making, and location-based services, may involve incomplete data items, i.e., some dimensional values are missing, due to the device failure or the privacy preservation. This paper is the first, to our knowledge, study of k-skyband (kSB) query processing on incomplete data, where multi-dimensional data items are missing some values of their dimensions. We formalize the problem, and then present two efficient algorithms for processing it. Our methods introduce some novel concepts including expired skyline, shadow skyline, and thickness warehouse, in order to boost the search performance. As a second step, we extend our techniques to tackle constrained skyline (CS) and group-by skyline (GBS) queries over incomplete data. Extensive experiments with both real and synthetic data sets demonstrate the effectiveness and efficiency of our proposed algorithms under various experimental settings. (C) 2014 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?