Abstract:Portfolio management facilitates trading off risks against returns for multiple financial assets. Reinforcement Learning (RL) is one of the most promising algorithms for portfolio management. However, these state-of-the-art RL algorithms only complete the task of portfolio management, i.e., acquire the different asset features of portfolio, without considering the global context information from portfolio, which leads to non-optimal portfolio representations; Moreover, the corresponding optimizations are implemented using only the loss function in the viewpoint of RL, without considering the relationships between the local asset information and global context embeddings, which leads to non-optimal portfolio policies. To deal with these issues, this paper proposes a Task-Context Mutual Actor–Critic (TC-MAC) algorithm for portfolio management. Specifically, TC-MAC algorithm is developed based on: (1) representation learning introduces a proposed Task-Context (TC) learning algorithm, which not only encodes the task (i.e., acquire different asset features) of portfolio, but also encodes the global dynamic context of portfolio, thus which helps to learn optimal portfolio embeddings; (2) policy learning introduces a proposed Mutual Actor–Critic (MAC) framework, which can measure the relationships between local embedding of each asset and global context embeddings by maximizing mutual information, the corresponding Mutual-Information loss function combines with RL loss function (i.e., Actor–Critic loss) to collectively optimize the whole algorithm, thus which helps to learn optimal portfolio policies. Experimental results on real-world datasets demonstrate the superior performance of TC-MAC algorithm over the well-known traditional portfolio methods and these state-of-the-art RL algorithms, at the same time, show its advantageous transferability.

Continual portfolio selection in dynamic environments via incremental reinforcement learning

A General Framework on Enhancing Portfolio Management with Reinforcement Learning

Continuous‐time mean–variance portfolio selection: A reinforcement learning framework

A Deep Reinforcement Learning Approach for Portfolio Management in Non‐Short‐Selling Market

Reinforcement Learning for Continuous-Time Mean-Variance Portfolio Selection in a Regime-Switching Market

Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization

A Deep Reinforcement Learning Framework For Financial Portfolio Management

Incremental Reinforcement Learning in Continuous Spaces Via Policy Relaxation and Importance Weighting.

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

Portfolio management based on a reinforcement learning framework

Bridging the gap between Markowitz planning and deep reinforcement learning

Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning

Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management

DeepTrader: A Deep Reinforcement Learning Approach for Risk-Return Balanced Portfolio Management with Market Conditions Embedding

Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module

A new deep reinforcement learning model for dynamic portfolio optimization

Lifelong Incremental Reinforcement Learning with Online Bayesian Inference

Deep reinforcement learning for portfolio management

Deep Reinforcement Learning for Stock Portfolio Optimization

MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization

Deep Reinforcement Learning Approach to Portfolio Optimization in the Australian Stock Market