Abstract:To date, transfer learning (TL) has been successfully applied for enhancing the learning performance of reinforcement learning (RL), and many transfer RL (TRL) approaches have been proposed in the literature. However, most of the existing TRL approaches consider knowledge transfer between RL tasks sharing the same state-action space. These methods thus may fail in cases where the RL tasks available for conducting knowledge transfer possess heterogeneous state-action spaces, which is common in many real-world applications. TRL across heterogeneous problem domains is challenging since the differences lie in the state-action spaces of the RL tasks are natural barriers in the knowledge transfer across tasks. This becomes more difficult if multiple heterogeneous source tasks are available when conducting knowledge transfer for a target RL task, as we have to identify the appropriate source task adaptively before performing knowledge transfer towards enhanced RL performance. In this article, we propose a new TRL algorithm with adaptive policy gradient transfer for the cases having multiple heterogeneous source RL tasks. The core ingredients of the proposed algorithm contain a source task selection module to select an appropriate task from a set of heterogeneous source tasks and a knowledge transfer module for conducting knowledge transfer across heterogeneous RL tasks. To investigate the performance of the proposed algorithm, we have conducted comprehensive empirical studies based on the well-known continuous robotic RL task with heterogeneous settings in the number of robot arms (links). The obtained results show that the proposed algorithm is effective and efficient in conducting knowledge transfer across heterogeneous problems for enhanced RL performance, over both the RL algorithm having no knowledge transfer in the learning process and the existing state-of-the-art TRL method.

Cross-domain adaptive transfer reinforcement learning based on state-action correspondence.

Reinforcement Learning with Adaptive Policy Gradient Transfer Across Heterogeneous Problems

Efficient Deep Reinforcement Learning Via Adaptive Policy Transfer

Deep Reinforcement Learning for Autonomous Driving by Transferring Visual Features.

Efficient Deep Reinforcement Learning Through Policy Transfer.

Transferring knowledge from human-demonstration trajectories to reinforcement learning

Skill based transfer learning with domain adaptation for continuous reinforcement learning domains

Transfer with Action Embeddings for Deep Reinforcement Learning

A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning

Task and Domain Adaptive Reinforcement Learning for Robot Control

An advantage based policy transfer algorithm for reinforcement learning with measures of transferability

Domain Adaptive State Representation Alignment for Reinforcement Learning

Cross-Domain Communications Between Agents Via Adversarial-Based Domain Adaptation in Reinforcement Learning

Shaping in Reinforcement Learning Via Knowledge Transferred from Human-Demonstrations

Cross Domain Policy Transfer with Effect Cycle-Consistency

Learning Action-Transferable Policy with Action Embedding

Cross-Modal Domain Adaptation for Reinforcement Learning

Cross-domain policy adaptation with dynamics alignment

Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Lateral Transfer Learning for Multiagent Reinforcement Learning.