Concentration bounds for two time scale stochastic approximation

Vivek S. Borkar,Sarath Pattathil
DOI: https://doi.org/10.48550/arXiv.1806.10798
2018-06-28
Abstract:Viewing a two time scale stochastic approximation scheme as a noisy discretization of a singularly perturbed differential equation, we obtain a concentration bound for its iterates that captures its behavior with quantifiable high probability. This uses Alekseev's nonlinear variation of constants formula and a martingale concentration inequality and extends the corresponding results for single time scale stochastic approximation.
Optimization and Control
What problem does this paper attempt to address?