Abstract:Visual reinforcement learning is important in various practical applications, such as video games, robotic manipulation, and autonomous navigation. However, a major challenge in visual reinforcement learning is the generalization to unseen environments, that is, how agents manage environments with previously unseen backgrounds. This issue is triggered mainly by the high unpredictability inherent in high-dimensional observation space. To deal with this problem, techniques including domain randomization and data augmentation have been explored; nevertheless, these methods still cannot attain a satisfactory result. This paper proposes a new method named Internal States Simulation Auxiliary (ISSA), which uses internal states to improve generalization in visual reinforcement learning tasks. Our method contains two agents, a teacher agent and a student agent: the teacher agent has the ability to directly access the environment's internal states and is used to facilitate the student agent's training; the student agent receives initial guidance from the teacher agent and subsequently continues to learn independently. From another perspective, our method can be divided into two phases, the transfer learning phase and traditional visual reinforcement learning phase. In the first phase, the teacher agent interacts with environments and imparts knowledge to the vision-based student agent. With the guidance of the teacher agent, the student agent is able to discover more effective visual representations that address the high unpredictability of high-dimensional observation space. In the next phase, the student agent autonomously learns from the visual information in the environment, and ultimately, it becomes a vision-based reinforcement learning agent with enhanced generalization. The effectiveness of our method is evaluated using the DMControl Generalization Benchmark and the DrawerWorld with texture distortions. Preliminary results indicate that our method significantly improves generalization ability and performance in complex continuous control tasks.

Learning Task-relevant Representations for Generalization Via Characteristic Functions of Reward Sequence Distributions

Generalization in Visual Reinforcement Learning with the Reward Sequence Distribution

Learning Robust Representation for Reinforcement Learning with Distractions by Reward Sequence Prediction.

Reinforcement Learning with Generalizable Gaussian Splatting

Recovering Permuted Sequential Features for effective Reinforcement Learning

Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations

Sequential Action-Induced Invariant Representation for Reinforcement Learning

Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence

Learning explainable task-relevant state representation for model-free deep reinforcement learning

Learning Controllable Elements Oriented Representations for Reinforcement Learning

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning.

Task-Induced Representation Learning

Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications

Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning

Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning

RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization

RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability

Generalization Enhancement of Visual Reinforcement Learning through Internal States

Normalization Enhances Generalization in Visual Reinforcement Learning.

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning