Asynchronous Block Parallel Policy Optimization for the Linear Quadratic Regulator*

Xingyu Sha,Keyou You
DOI: https://doi.org/10.23919/acc60939.2024.10645010
2024-01-01
Abstract:Though policy optimization (PO) has been acknowledged as an essential approach in reinforcement learning, its theoretical understandings still lag behind as we usually have to address non-convex problems. In this work, we study the convergence of the PO method for the linear quadratic regulator problem under asynchronous block parallel policy updates. Particularly, there are a group of agents to jointly compute the policy and each agent is only responsible for the updates of a block of the policy via asynchronously communicating with a central coordinator. Then, we rigorously prove its linear convergence to the optimal policy. Numerical results validate the performance of our method.
What problem does this paper attempt to address?