Abstract:Portfolio management facilitates trading off risks against returns for multiple financial assets. Reinforcement Learning (RL) is one of the most promising algorithms for portfolio management. However, these state-of-the-art RL algorithms only complete the task of portfolio management, i.e., acquire the different asset features of portfolio, without considering the global context information from portfolio, which leads to non-optimal portfolio representations; Moreover, the corresponding optimizations are implemented using only the loss function in the viewpoint of RL, without considering the relationships between the local asset information and global context embeddings, which leads to non-optimal portfolio policies. To deal with these issues, this paper proposes a Task-Context Mutual Actor–Critic (TC-MAC) algorithm for portfolio management. Specifically, TC-MAC algorithm is developed based on: (1) representation learning introduces a proposed Task-Context (TC) learning algorithm, which not only encodes the task (i.e., acquire different asset features) of portfolio, but also encodes the global dynamic context of portfolio, thus which helps to learn optimal portfolio embeddings; (2) policy learning introduces a proposed Mutual Actor–Critic (MAC) framework, which can measure the relationships between local embedding of each asset and global context embeddings by maximizing mutual information, the corresponding Mutual-Information loss function combines with RL loss function (i.e., Actor–Critic loss) to collectively optimize the whole algorithm, thus which helps to learn optimal portfolio policies. Experimental results on real-world datasets demonstrate the superior performance of TC-MAC algorithm over the well-known traditional portfolio methods and these state-of-the-art RL algorithms, at the same time, show its advantageous transferability.

Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey

Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization

Model-Free Reinforcement Learning for Asset Allocation

Uncertainty-Aware Reinforcement Learning for Portfolio Optimization

Recent Advances in Reinforcement Learning in Finance

A General Framework on Enhancing Portfolio Management with Reinforcement Learning

Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

A Deep Reinforcement Learning Framework For Financial Portfolio Management

A Review of Reinforcement Learning in Financial Applications

Reinforcement Learning Portfolio Manager Framework with Monte Carlo Simulation

A Two-Level Reinforcement Learning Algorithm for Ambiguous Mean-Variance Portfolio Selection Problem

Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment

Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting

Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module

Deep reinforcement learning for portfolio management

Deep Reinforcement Learning for Asset Allocation in US Equities

Bridging the gap between Markowitz planning and deep reinforcement learning

From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization

Reinforcement Learning-Based Multimodal Model for the Stock Investment Portfolio Management Task

A Deep Reinforcement Learning Approach for Portfolio Management in Non‐Short‐Selling Market