A Robust Offline Reinforcement Learning Algorithm Based on Behavior Regularization Methods

Yan Zhang,Tianhan Gao,Qingwei Mi
DOI: https://doi.org/10.1109/iaict55358.2022.9887435
2022-01-01
Abstract:Offline deep reinforcement learning algorithms are still in developing. Some existing algorithms have shown that it is feasible to learn directly without using environmental interaction under the condition of sufficient datasets. In this paper, we combine an offline reinforcement learning method through behavior regularization with a robust offline reinforcement learning algorithm. Moreover, the algorithm is verified and analyzed with a high-quality but limited dataset. The experimental results show that it is feasible to combine the behavior regularization method with the robust offline reinforcement learning algorithm, to gain better performance under the condition of limited data compared with the baseline algorithms.
What problem does this paper attempt to address?