Dynamics-Aware Context Representation for Domain Adaptation in Reinforcement Learning

Kai Deng,Baiming Chen,Liang Li
DOI: https://doi.org/10.1109/icrae56463.2022.10056207
2022-01-01
Abstract:A context can be used as an embedding extracted from historical trajectories of dynamic systems to provide meaningful information for reinforcement learning (RL) agents, thus improving the domain adaptability and robustness of RL method. However, the process of context extraction involves two key issues: How to efficiently train an encoder to extract context information from historical trajectories? And how to ensure that context information can distinguish different dynamics clearly? To tackle the problems above, a dynamics-aware context representation reinforcement learning (DacRL) is proposed in this study. We leverage the Cycle-Consistent VAE method to extract a meaningful context from historical trajectories and then divide it into domain-specific and domain-general embedding. Furthermore, we consider the contrastive nature between different tasks and use it to improve the quality of domain-specific information, so that it can represent dynamics more clearly. Finally, the current state combined with the domain-specific information is delivered into the RL agent, so as to improve the generalization of the RL agent. The simulation results illustrate that the proposed DacRL is superior to other baselines.
What problem does this paper attempt to address?