Technical Note-Knowledge Gradient for Selection with Covariates: Consistency and Computation

Liang Ding,L. Jeff Hong,Haihui Shen,Xiaowei Zhang
DOI: https://doi.org/10.1002/nav.22028
2021-01-01
Naval Research Logistics (NRL)
Abstract:Knowledge gradient is a design principle for developing Bayesian sequentialsampling policies to solve optimization problems. In this paper we consider theranking and selection problem in the presence of covariates, where the bestalternative is not universal but depends on the covariates. In this context, weprove that under minimal assumptions, the sampling policy based on knowledgegradient is consistent, in the sense that following the policy the bestalternative as a function of the covariates will be identified almost surely asthe number of samples grows. We also propose a stochastic gradient ascentalgorithm for computing the sampling policy and demonstrate its performance vianumerical experiments.
What problem does this paper attempt to address?