Abstract:Top-K ranking queries in uncertain databases aim to find the top-K tuples according to a ranking function. The interplay between score and uncertainty makes top-K ranking in uncertain databases an intriguing issue, leading to rich query semantics. Recently, a unified ranking framework based on parameterized ranking functions (PRFs) has been formulated, which generalizes many previously proposed ranking semantics. Under the PRFs based ranking framework, efficient pruning approach for Top-K ranking on datasets with tuple-wise uncertainty has been well studied in the literature. However, this cannot be applied to top-K ranking on datasets with attribute-wise uncertainty, which are often natural and useful in analyzing uncertain data in many applications. This paper aims to develop efficient pruning techniques for top-K ranking on datasets with attribute-wise uncertainty under the PRFs based ranking framework, which has not been well studied in the literature. We first develop a Tuple Insertion Based Algorithm for computing each tuple’s PRF value, which reduce the time cost from the state of the art cubic order of magnitude to quadratic order of magnitude. Based on the Tuple Insertion Based Algorithm, three pruning strategies are developed to further reduce the time cost. The mathematics of deriving the Tuple Insertion Based Algorithm and corresponding pruning strategies are also presented. At last, we show that our pruning algorithms can also be applied to the computation of the top-k aggregate queries. The experimental results on both real and synthetic data demonstrate the effectiveness and efficiency of the proposed pruning techniques.

Top-K Aggregate Queries on Continuous Probabilistic Datasets

Ranking Continuous Probabilistic Datasets.

Efficient Pruning for Top-K Ranking Queries on Attribute-Wise Uncertain Datasets

Semantics and Evaluation of Top-k Queries in Probabilistic Databases

Fast Probabilistic Ranking under x-Relation Model

Efficient Pruning Algorithm for Top-K Ranking on Dataset with Value Uncertainty

Probabilistic Top-k Dominating Query Monitoring over Multiple Uncertain IoT Data Streams in Edge Computing Environments

Sensitivity Analysis of Answer Ordering from Probabilistic Databases.

Probabilistic Top-k Dominating Query over Sliding Windows.

Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams

Aggregation-Aware Top-k Computation for Full-Text Search

Top-k Dominating Queries on Incomplete Data

Aggregate Queries on Constrained Probabilistic Similarity Join Pairs

Handling Er-Topk Query On Uncertain Streams

A Unified Approach to Ranking in Probabilistic Databases

Processing Spatial Keyword Query As a Top-K Aggregation Query

Continuous Temporal Top-k Query over Versioned Documents.

Querying Uncertain Data with Aggregate Constraints.

Handling ER-top

Sliding window top-k dominating query processing over distributed data streams

Crowdsourced Top- K Queries by Pairwise Preference Judgments with Confidence and Budget Control