Abstract:To date, transfer learning (TL) has been successfully applied for enhancing the learning performance of reinforcement learning (RL), and many transfer RL (TRL) approaches have been proposed in the literature. However, most of the existing TRL approaches consider knowledge transfer between RL tasks sharing the same state-action space. These methods thus may fail in cases where the RL tasks available for conducting knowledge transfer possess heterogeneous state-action spaces, which is common in many real-world applications. TRL across heterogeneous problem domains is challenging since the differences lie in the state-action spaces of the RL tasks are natural barriers in the knowledge transfer across tasks. This becomes more difficult if multiple heterogeneous source tasks are available when conducting knowledge transfer for a target RL task, as we have to identify the appropriate source task adaptively before performing knowledge transfer towards enhanced RL performance. In this article, we propose a new TRL algorithm with adaptive policy gradient transfer for the cases having multiple heterogeneous source RL tasks. The core ingredients of the proposed algorithm contain a source task selection module to select an appropriate task from a set of heterogeneous source tasks and a knowledge transfer module for conducting knowledge transfer across heterogeneous RL tasks. To investigate the performance of the proposed algorithm, we have conducted comprehensive empirical studies based on the well-known continuous robotic RL task with heterogeneous settings in the number of robot arms (links). The obtained results show that the proposed algorithm is effective and efficient in conducting knowledge transfer across heterogeneous problems for enhanced RL performance, over both the RL algorithm having no knowledge transfer in the learning process and the existing state-of-the-art TRL method.

Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning

Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review

Shaping in Reinforcement Learning by Knowledge Transferred from Human-Demonstrations of a Simple Similar Task.

Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch

Shaping in Reinforcement Learning Via Knowledge Transferred from Human-Demonstrations

Transferring knowledge from human-demonstration trajectories to reinforcement learning

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function

Efficient Exploration for Multi-Agent Reinforcement Learning Via Transferable Successor Features

Cross Domain Policy Transfer with Effect Cycle-Consistency

Cross-Modal Domain Adaptation for Reinforcement Learning

Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning

A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems

Cross-Domain Policy Transfer by Representation Alignment via Multi-Domain Behavioral Cloning

Robust Knowledge Transfer in Tiered Reinforcement Learning

Transfer Reinforcement Learning in Heterogeneous Action Spaces using Subgoal Mapping

Reinforcement Learning with Adaptive Policy Gradient Transfer Across Heterogeneous Problems

A Framework for Few-Shot Policy Transfer through Observation Mapping and Behavior Cloning

Learning To Walk With Prior Knowledge

Transfer with Action Embeddings for Deep Reinforcement Learning

Research on Isomorphic Task Transfer Algorithm Based on Knowledge Distillation in Multi-Agent Collaborative Systems

Knowledge Transfer Across Modalities with Natural Language Supervision