A Machine Learning Algorithm for Finite-Horizon Stochastic Control Problems in Economics

Xianhua Peng,Steven Kou,Lekang Zhang
2024-11-13
Abstract:We propose a machine learning algorithm for solving finite-horizon stochastic control problems based on a deep neural network representation of the optimal policy functions. The algorithm has three features: (1) It can solve high-dimensional (e.g., over 100 dimensions) and finite-horizon time-inhomogeneous stochastic control problems. (2) It has a monotonicity of performance improvement in each iteration, leading to good convergence properties. (3) It does not rely on the Bellman equation. To demonstrate the efficiency of the algorithm, it is applied to solve various finite-horizon time-inhomogeneous problems including recursive utility optimization under a stochastic volatility model, a multi-sector stochastic growth, and optimal control under a dynamic stochastic integration of climate and economy model with eight-dimensional state vectors and 600 time periods.
General Economics,Optimization and Control,Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to solve the stochastic control problem within a finite time - horizon in economics. Specifically, the author proposes a machine - learning algorithm based on the deep neural network to represent the optimal policy function, namely the Monotonic Monte - Carlo Control (MMCC) algorithm, in order to address the following challenges: 1. **High - dimensional and finite time - horizon**: Many stochastic control problems in economics are of a finite time - horizon and time - varying (time - inhomogeneous), which are more difficult to solve than the infinite time - horizon problems because the optimal control strategies in different time periods are different. 2. **The curse of dimensionality**: Due to high - dimensionality (for example, more than 100 dimensions) and complex stochastic dynamics, it is usually very difficult to numerically solve high - dimensional stochastic control problems. 3. **Limitations of the Bellman equation**: If the utility function in the control problem is not time - separable, then the problem may not have a Bellman equation, and thus the traditional dynamic programming method cannot be used. ### Solutions proposed in the paper To overcome the above difficulties, the author proposes the MMCC algorithm, whose main features include: - **Not relying on the Bellman equation**: The MMCC algorithm does not need to use the Bellman equation or the Euler equation, so it can handle a wider range of stochastic control problems. - **Monotonically improving performance**: In each iteration, the algorithm monotonically improves performance, thus having good convergence properties. - **Applicable to complex stochastic dynamics**: The algorithm allows the state evolution to have general stochastic dynamics, not just limited to discretized diffusion processes or Lévy processes. ### Application examples The paper demonstrates the effectiveness of the MMCC algorithm in multiple high - dimensional (such as more than 100 dimensions) stochastic control problems, including: - **Recursive utility optimization under the stochastic volatility model** - **Multi - sector stochastic growth problems** - **A stochastic integrated model of climate and economic dynamics**, where the state vector is 8 - dimensional and the time period is 600. Through these applications, the paper verifies the high efficiency and applicability of the MMCC algorithm in solving complex economic problems. ### Summary This paper aims to provide a new and efficient machine - learning algorithm to solve high - dimensional, finite - time - horizon, and time - inhomogeneous stochastic control problems in economics, especially those problems that are not suitable for being solved by traditional dynamic programming methods.