Abstract:We are concerned with the linear-quadratic optimal stochastic control problem with random coefficients. Under suitable conditions, we prove that the value field $V(t,x,\omega), (t,x,\omega)\in [0,T]\times R^n\times \Omega$, is quadratic in $x$, and has the following form: $V(t,x)=\langle K_tx, x\rangle$ where $K$ is an essentially bounded nonnegative symmetric matrix-valued adapted processes. Using the dynamic programming principle (DPP), we prove that $K$ is a continuous semi-martingale of the form $$K_t=K_0+\int_0^t \, dk_s+\sum_{i=1}^d\int_0^tL_s^i\, dW_s^i, \quad t\in [0,T]$$ with $k$ being a continuous process of bounded variation and $$E\left[\left(\int_0^T|L_s|^2\, ds\right)^p\right] <\infty, \quad \forall p\ge 2; $$ and that $(K, L)$ with $L:=(L^1, \cdots, L^d)$ is a solution to the associated backward stochastic Riccati equation (BSRE), whose generator is highly nonlinear in the unknown pair of processes. The uniqueness is also proved via a localized completion of squares in a self-contained manner for a general BSRE. The existence and uniqueness of adapted solution to a general BSRE was initially proposed by the French mathematician J. M. Bismut (1976, 1978). It had been solved by the author (2003) via the stochastic maximum principle with a viewpoint of stochastic flow for the associated stochastic Hamiltonian system. The present paper is its companion, and gives the {\it second but more comprehensive} adapted solution to a general BSRE via the DDP. Further extensions to the jump-diffusion control system and to the general nonlinear control system are possible.

Convergence of stochastic iterative dynamic programming algorithms

Stochastic iterative dynamic programming: a Monte Carlo approach to dual control

Approximate dynamic programming for continuous state and control problems

Revisiting approximate dynamic programming and its convergence

Improved Dynamic Programming Methods for Optimal Control of Lumped-Parameter Stochastic Systems

A General Framework for Analyzing Stochastic Dynamics in Learning Algorithms

Convergence of Direct Heuristic Dynamic Programming in Power System Stability Control.

Complexity of stochastic dual dynamic programming

Dynamic Programming for General Linear Quadratic Optimal Stochastic Control with Random Coefficients

Convergence of Dynamic Programming Models

Variational Dynamic Programming for Stochastic Optimal Control

A kernel-based approximate dynamic programming approach: Theory and application

An Approximate Dynamic Programming Approach for Dual Stochastic Model Predictive Control

Stochastic Primal-Dual Algorithms with Faster Convergence Than O(1/√T) for Problems Without Bilinear Structure

Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications

Information Relaxation and A Duality-Driven Algorithm for Stochastic Dynamic Programs

The landscape of deterministic and stochastic optimal control problems: One-shot Optimization versus Dynamic Programming

Optimizing Trajectories with Closed-Loop Dynamic SQP

Differential dynamic programming with stagewise equality and inequality constraints using interior point method

Stochastic Primal-Dual Algorithms with Faster Convergence than $O(1/\sqrt{T})$ for Problems without Bilinear Structure

Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems