Independence and conditional independence in discrete-time dynamic allocation problems

Christopher Wang
2023-12-15
Abstract:The dynamic allocation problem simulates a situation in which one is faced with a tradeoff between actions that yield an immediate reward and actions whose benefits can only be perceived in the future. In this paper, we discuss solutions to the non-Markovian, discrete-time problem under the assumption that the rewards processes are either independent or conditionally independent. In the conditionally independent setting, we prove that the maximal attainable value of the dynamic allocation problem can be represented as the maximal attainable value of a different but related dynamic allocation problem in which all rewards processes are almost surely decreasing, and that there exists a strategy which is optimal for both problems. We also discuss strategies of the so-called ``index type,'' and their relationship with the ``synchronization'' paradigm from operations research.
Probability,Optimization and Control
What problem does this paper attempt to address?