Learning to Optimally Stop a Diffusion Process

Min Dai,Yu Sun,Zuo Quan Xu,Xun Yu Zhou
2024-08-18
Abstract:We study optimal stopping for a diffusion process with unknown model primitives within the continuous-time reinforcement learning (RL) framework developed by Wang et al. (2020). By penalizing its variational inequality, we transform the stopping problem into a stochastic optimal control problem with two actions. We then randomize control into Bernoulli distributions and add an entropy regularizer to encourage exploration. We derive a semi-analytical optimal Bernoulli distribution, based on which we devise RL algorithms using the martingale approach established in Jia and Zhou (2022a) and prove a policy improvement theorem. Finally, we demonstrate the effectiveness of the algorithms in examples of pricing finite-horizon American put options and solving Merton's problem with transaction costs, and show that both the offline and online algorithms achieve high accuracy in learning the value functions and characterizing the associated free boundaries.
Optimization and Control,Mathematical Finance,Pricing of Securities
What problem does this paper attempt to address?