Abstract:This paper concerns quasi-stochastic approximation (QSA) to solve root finding problems commonly found in applications to optimization and reinforcement learning. The general constant gain algorithm may be expressed as the time-inhomogeneous ODE $ \frac{d}{dt}\Theta_t=\alpha f_t (\Theta_t)$, with state process $\Theta$ evolving on $\mathbb{R}^d$. Theory is based on an almost periodic vector field, so that in particular the time average of $f_t(\theta)$ defines the time-homogeneous mean vector field $\bar{f} \colon \mathbb{R}^d \to \mathbb{R}^d$ with $\bar{f}(\theta^*)=0$. Under smoothness assumptions on the functions involved, the following exact representation is obtained: \[\frac{d}{dt}\Theta_t=\alpha[\bar{f}(\Theta_t)-\alpha\bar\Upsilon_t+\alpha^2\mathcal{W}_t^0+\alpha\frac{d}{dt}\mathcal{W}_t^1+\frac{d^2}{dt^2}\mathcal{W}_t^2]\] along with formulae for the smooth signals $\{\bar \Upsilon_t , \mathcal{W}_t^i : i=0, 1, 2\}$. This new representation, combined with new conditions for ultimate boundedness, has many applications for furthering the theory of QSA and its applications, including the following implications that are developed in this paper: (i) A proof that the estimation error $\|\Theta_t-\theta^*\|$ is of order $O(\alpha)$, but can be reduced to $O(\alpha^2)$ using a second order linear filter. (ii) In application to extremum seeking control, it is found that the results do not apply because the standard algorithms are not Lipschitz continuous. A new approach is presented to ensure that the required Lipschitz bounds hold, and from this we obtain stability, transient bounds, and asymptotic bias of order $O(\alpha^2)$, and asymptotic variance of order $O(\alpha^4)$. (iii) It is in general possible to obtain better than $O(\alpha)$ bounds on error in traditional stochastic approximation when there is Markovian noise.

Markovian Foundations for Quasi-Stochastic Approximation with Applications to Extremum Seeking Control

Markovian Foundations for Quasi-Stochastic Approximation in Two Timescales: Extended Version

Time-delayed Feedback Control Optimization for Quasi Linear Systems under Random Excitations

Stochastic Optimal Control of Quasi Non-Integrable Hamiltonian Systems with Stochastic Maximum Principle

The Curse of Memory in Stochastic Approximation: Extended Version

Stochastic Approximation with Unbounded Markovian Noise: A General-Purpose Theorem

Stochastic Successive Convex Approximation for Non-Convex Constrained Stochastic Optimization

On Stochastic Optimal Control of Partially Observable Nonlinear Quasi Hamiltonian Systems

Stochastic Minimax Vibration Control for Uncertain Nonlinear Quasi-Hamiltonian Systems with Noisy Observations

Time-delay Induced Stochastic Optimization and Extremum Seeking

The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise

Extremely Fast Convergence Rates for Extremum Seeking Control with Polyak-Ruppert Averaging

Stochastic Averaging in Continuous Time and Its Applications to Extremum Seeking

A Revisit to Stochastic Near-Optimal Controls: the Critical Case

A Multilevel Approach for Stochastic Nonlinear Optimal Control

Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data

A Branching Particle System Approximation for Solving Partially Observed Stochastic Optimal Control Problems Via Stochastic Maximum Principle

A Modified Method of Successive Approximations for Stochastic Recursive Optimal Control Problems

Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise

Decentralized Control for Optimal Lq Problems in Stochastic Systems with Unknown Uncertainties

Single-Loop Stochastic Algorithms for Difference of Max-Structured Weakly Convex Functions