Abstract:The problem of high-dimensional path-dependent optimal stopping (OS) is important to multiple academic communities and applications. Modern OS tasks often have a large number of decision epochs, and complicated non-Markovian dynamics, making them especially challenging. Standard approaches, often relying on ADP, duality, deep learning and other heuristics, have shown strong empirical performance, yet have limited rigorous guarantees (which may scale exponentially in the problem parameters and/or require previous knowledge of basis functions or additional continuity assumptions). Although past work has placed these problems in the framework of computational complexity and polynomial-time approximability, those analyses were limited to simple one-dimensional problems. For long-horizon complex OS problems, is a polynomial time solution even theoretically possible? We prove that given access to an efficient simulator of the underlying information process, and fixed accuracy epsilon, there exists an algorithm that returns an epsilon-optimal solution (both stopping policies and approximate optimal values) with computational complexity scaling polynomially in the time horizon and underlying dimension. Like the first polynomial-time (approximation) algorithms for several other well-studied problems, our theoretical guarantees are polynomial yet impractical. Our approach is based on a novel expansion for the optimal value which may be of independent interest.

The Duration of Optimal Stopping Problems

Asymptotic Duration for Optimal Multiple Stopping Problems

The Optimal Stopping Problem under a Random Horizon

Optimal stopping problem under random horizon

Optimal Multistage Sampling in a Boundary-Crossing Problem

Comparative Statics for Optimal Stopping Problems in Nonstationary Environments

Randomized Optimal Stopping Problem in Continuous Time and Reinforcement Learning Algorithm

On the rates of convergence of simulation based optimization algorithms for optimal stopping problems

Data-driven optimal stopping: A pure exploration analysis

Optimal Stopping for Dynamic Recruitment Problem with Probabilistic Loss of Candidates

Exploratory Optimal Stopping: A Singular Control Formulation

Sequential Design for Optimal Stopping Problems

The last-success stopping problem with random observation times

Polynomial time algorithm for optimal stopping with fixed accuracy

Infinite horizon stopping problems with (nearly) total reward criteria

Optimal stopping with behaviorally biased agents: The role of loss aversion and changing reference points

Stochastic Processes with Expected Stopping Time

Time-inconsistent mean-field optimal stopping: A limit approach

Efficiency of a Stochastic Search with Punctual and Costly Restarts

Risk-Sensitive Stopping Problems for Continuous-Time Markov Chains