A Stable Actor-Critic Algorithm for Solving Robotic Tasks with Multiple Constraints

Peiyao Zhao,Fei Zhu,Quan Liu,Xinghong Ling
DOI: https://doi.org/10.1007/s11704-022-1612-9
IF: 2.6688
2022-01-01
Frontiers of Computer Science
Abstract:1 Introduction Deep reinforcement learning has achieved great success especially in game[1]and control areas.Unfortunately,for real world environments that involve more than one objectives.For example,an autonomous car should consider constraints such as the driving speed,energy efficiency,comfort and safety of the passengers[2,3].To solve the problems,Constrained Markov Decision Process(CMDP)was proposed to model tasks with constraints[4,5].
What problem does this paper attempt to address?