Efficient Top-k Dominating Computation on Massive Data

Xixian Han,Jianzhong Li,Hong Gao
DOI: https://doi.org/10.1109/tkde.2017.2665619
IF: 9.235
2017-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:In many applications, top-k dominating query is an important operation to return k tuples with the highest domination scores in a potentially huge data space. It is analyzed that the existing algorithms have their performance problems when performed on massive data. This paper proposes a novel table-scan-based TDTS algorithm to efficiently compute top-k dominating results. TDTS first presorts the table for early termination. The early termination checking is proposed in this paper, along with the theoretical analysis of scan depth. The pruning operation for tuples is devised in this paper. The theoretical pruning effect shows that the number of tuples maintained in TDTS can be reduced substantially. The extensive experimental results, conducted on synthetic and real-life data sets, show that TDTS outperforms the existing algorithms significantly.
What problem does this paper attempt to address?