Abstract:With the rapid development of computing technology, using parallel computing to solve large-scale ranking-and-selection (R&S) problems has emerged as an important research topic. However, direct implementation of traditionally fully sequential procedures in parallel computing environments may encounter various problems. First, the scheme of all-pairwise comparisons, which is commonly used in fully sequential procedures, requires a large amount of computation and significantly slows down the selection process. Second, traditional fully sequential procedures require frequent communication and coordination among processors, which are also not efficient in parallel computing environments. In this paper, we propose three modifications on one classical fully sequential procedure, Paulson's procedure, to speed up its selection process in parallel computing environments. First, we show that if no common random numbers are used, then we can significantly reduce the computation spent on all-pairwise comparisons at each round. Second, by batching different alternatives, we show that we can reduce the communication cost among the processors, leading the procedure to achieve better performance. Third, to boost the procedure's final-stage selection, when the number of surviving alternatives is less than the number of processors, we suggest to sample all surviving alternatives to the maximal number of observations that they should take. We show that, after these modifications, the procedure remains statistically valid and is more efficient compared with existing parallel procedures in the literature. Summary of Contribution: Ranking and selection (R&S) is a branch of simulation optimization, which is an important area of operations research. In recent years, using parallel computing to solve large-scale R&S problems has emerged as an important research topic, and this research topic is naturally situated in the intersection of computing and operations research. In this paper, we consider how to improve a fully sequential R&S procedure, namely, Paulson's procedure, to reduce the high computational complexity of all-pairwise comparisons and the burden of frequent communications and coordination, so that the procedure is more suitable and more efficient in solving large-scale R&S problems using parallel computing environments that are becoming ubiquitous and accessible for ordinary users. The procedure designed in this paper appears more efficient than the ones available in the literature and is capable of solving R&S problems with over a million alternatives in a parallel computing environment with 96 processors. The paper also extended the theory of R&S by showing that the all-pairwise comparisons may be decomposed so that the computational complexity may be reduced significantly, which drastically improves the efficiency of all-pairwise comparisons as observed in numerical experiments.

Efficient processing of top-k queries: selective NRA algorithms

Selective-NRA Algorithms for Top-k Queries.

Theory and Application of Total Project Management

Performance Optimization of Top-k Queries on Multicore Platform

Performance Optimization of Top-k Queries on GPU.

Efficient Parallel Processing of High-Dimensional Spatial K NN Queries

Supporting Efficient Top-K Queries in Type-Ahead Search

Tight Data Access Bounds for Private Top-$k$ Selection

RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks

Processing Long Queries Against Short Text

Efficient Top-K Query Processing Algorithms in Highly Distributed Environments

Optimizing top-k retrieval: submodularity analysis and search strategies

Solving Large-Scale Fixed-Budget Ranking and Selection Problems

Optimal Computing Budget Allocation for Data-driven Ranking and Selection

Non-Myopic Knowledge Gradient Policy for Ranking and Selection.

Top-k learning to rank: labeling, ranking and evaluation.

More Bang For Your Buck(et): Fast and Space-efficient Hardware-accelerated Coarse-granular Indexing on GPUs

Efficient Pruning for Top-K Ranking Queries on Attribute-Wise Uncertain Datasets

Reverse top-k group nearest neighbor search

Speeding Up Paulson's Procedure for Large-Scale Problems Using Parallel Computing

Parallel Algorithms for Select and Partition with Noisy Comparisons