On Parametric Optimal Execution and Machine Learning Surrogates

Tao Chen,Mike Ludkovski,Moritz Voß
2023-10-29
Abstract:We investigate optimal order execution problems in discrete time with instantaneous price impact and stochastic resilience. First, in the setting of linear transient price impact we derive a closed-form recursion for the optimal strategy, extending the deterministic results from Obizhaeva and Wang (J Financial Markets, 2013). Second, we develop a numerical algorithm based on dynamic programming and deep learning for the case of nonlinear transient price impact as proposed by Bouchaud et al. (Quant. Finance, 2004). Specifically, we utilize an actor-critic framework that constructs two neural-network (NN) surrogates for the value function and the feedback control. The flexible scalability of NN functional approximators enables parametric learning, i.e., incorporating several model or market parameters as part of the input space. Precise calibration of price impact, resilience, etc., is known to be extremely challenging and hence it is critical to understand sensitivity of the execution policy to these parameters. Our NN learner organically scales across multiple input dimensions and is shown to accurately approximate optimal strategies across a wide range of parameter configurations. We provide a fully reproducible Jupyter Notebook with our NN implementation, which is of independent pedagogical interest, demonstrating the ease of use of NN surrogates in (parametric) stochastic control problems.
Trading and Market Microstructure,Machine Learning,Computational Finance
What problem does this paper attempt to address?
The paper mainly addresses two aspects: 1. **Derivation of closed-form recursive formulas under linear instantaneous price impact**: Firstly, under the setting of linear instantaneous price impact, the authors derive a closed-form recursive formula for the optimal strategy, extending the results of Obizhaeva and Wang in the deterministic case. 2. **Development of numerical algorithms under nonlinear instantaneous price impact**: Secondly, for the case of nonlinear instantaneous price impact, the authors propose a numerical algorithm based on dynamic programming and deep learning. Specifically, they utilize the actor-critic framework to construct neural network agents for the value function and feedback control. This method can handle multiple input dimensions (i.e., different market parameters) and has been shown to accurately approximate the optimal strategy. Overall, this paper aims to address the problem of optimal execution strategies in financial trading, particularly in the presence of instantaneous price impact and market resilience. By introducing machine learning techniques (especially neural networks), the authors propose new methods to tackle the challenges posed by model parameter uncertainty and provide a flexible and scalable approach to handle these complex issues. Additionally, the paper provides a fully reproducible Jupyter Notebook that demonstrates how to use neural network agents to solve such stochastic control problems, which is very useful for both teaching and research.