TopN Sort Algorithm Based on Multiple Filtering

ZHANG Jun,YANG Jiahai,WANG Jilong
DOI: https://doi.org/10.3321/j.issn:1000-0054.2006.04.040
2006-01-01
Abstract:A TopN sort algorithm based on multiple filtering was developed to improve the performance of the conventional TopN sort algorithm.The algorithm first constructs a sampling set of(k×N) elements by randomly sampling the original dataset.The algorithm then finds the N~(th) element(in decreasing order) from the sampling set,which is used to filter out elements of the original dataset that are smaller than this element.This process is repeated till the number of elements in the original dataset is less than k×N.The algorithm then sorts the remaining elements in the original dataset using quicksort and outputs the first N elements.A theoretical analysis and sample comparisons show that the temporal performance of this TopN algorithm is about 50% better than conventional TopN algorithms,such as heapsort.
What problem does this paper attempt to address?