Abstract:To date, transfer learning (TL) has been successfully applied for enhancing the learning performance of reinforcement learning (RL), and many transfer RL (TRL) approaches have been proposed in the literature. However, most of the existing TRL approaches consider knowledge transfer between RL tasks sharing the same state-action space. These methods thus may fail in cases where the RL tasks available for conducting knowledge transfer possess heterogeneous state-action spaces, which is common in many real-world applications. TRL across heterogeneous problem domains is challenging since the differences lie in the state-action spaces of the RL tasks are natural barriers in the knowledge transfer across tasks. This becomes more difficult if multiple heterogeneous source tasks are available when conducting knowledge transfer for a target RL task, as we have to identify the appropriate source task adaptively before performing knowledge transfer towards enhanced RL performance. In this article, we propose a new TRL algorithm with adaptive policy gradient transfer for the cases having multiple heterogeneous source RL tasks. The core ingredients of the proposed algorithm contain a source task selection module to select an appropriate task from a set of heterogeneous source tasks and a knowledge transfer module for conducting knowledge transfer across heterogeneous RL tasks. To investigate the performance of the proposed algorithm, we have conducted comprehensive empirical studies based on the well-known continuous robotic RL task with heterogeneous settings in the number of robot arms (links). The obtained results show that the proposed algorithm is effective and efficient in conducting knowledge transfer across heterogeneous problems for enhanced RL performance, over both the RL algorithm having no knowledge transfer in the learning process and the existing state-of-the-art TRL method.

Robust Knowledge Transfer in Tiered Reinforcement Learning

TRCC: Transferable Congestion Control with Reinforcement Learning

Transferring knowledge from human-demonstration trajectories to reinforcement learning

Shaping in Reinforcement Learning Via Knowledge Transferred from Human-Demonstrations

KnowRU: Knowledge Reusing via Knowledge Distillation in Multi-agent Reinforcement Learning

Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch

Transfer Learning Algorithm with Knowledge Division Level

KnowRU: Knowledge Reuse Via Knowledge Distillation in Multi-Agent Reinforcement Learning

Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning

Reinforcement Learning with Adaptive Policy Gradient Transfer Across Heterogeneous Problems

Doubly Robust Augmented Transfer for Meta-Reinforcement Learning.

Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities

Reusing Source Task Knowledge Via Transfer Approximator in Reinforcement Transfer Learning

Decoupling Dynamics and Reward for Transfer Learning

Strategy Selection In Complex Game Environments Based On Transfer Reinforcement Learning

Transfer of Temporal Logic Formulas in Reinforcement Learning

DGTRL: Deep graph transfer reinforcement learning method based on fusion of knowledge and data

Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach

Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from multiple sources in the same domain

Deep Q-learning with Explainable and Transferable Domain Rules.

An advantage based policy transfer algorithm for reinforcement learning with measures of transferability