High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning

Uta Pigorsch,Sebastian Schäfer
DOI: https://doi.org/10.48550/arXiv.2112.04755
2021-12-09
Abstract:This paper proposes a Deep Reinforcement Learning algorithm for financial portfolio trading based on Deep Q-learning. The algorithm is capable of trading high-dimensional portfolios from cross-sectional datasets of any size which may include data gaps and non-unique history lengths in the assets. We sequentially set up environments by sampling one asset for each environment while rewarding investments with the resulting asset's return and cash reservation with the average return of the set of assets. This enforces the agent to strategically assign capital to assets that it predicts to perform above-average. We apply our methodology in an out-of-sample analysis to 48 US stock portfolio setups, varying in the number of stocks from ten up to 500 stocks, in the selection criteria and in the level of transaction costs. The algorithm on average outperforms all considered passive and active benchmark investment strategies by a large margin using only one hyperparameter setup for all portfolios.
Portfolio Management,Machine Learning,Computational Finance
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to apply deep reinforcement learning algorithms in high - dimensional stock portfolio trading to achieve better investment returns. Specifically, the author proposes an algorithm based on Deep Q - learning, aiming to overcome the problems of insufficient flexibility and generalization ability in existing methods when dealing with single assets or asset portfolios. The paper focuses on how to optimize investment decisions of financial assets through deep reinforcement learning, especially in the face of a large number of assets, missing data and inconsistent historical lengths, being able to effectively train the model and make it perform well on out - of - sample data. The main contributions of the paper are as follows: 1. **Propose a new deep reinforcement learning algorithm**: This algorithm can use cross - sectional data sets for effective training when dealing with high - dimensional stock portfolios, and at the same time solve the problems of possible data missing and inconsistent historical lengths in the data set. 2. **Design a unique reward mechanism**: Through the reward mechanism, the agent is encouraged to invest when predicting that an asset will perform better than the average level, and choose to hold cash when the prediction is not good, thus promoting the agent to make more favorable investment decisions. 3. **Verify the effectiveness of the algorithm**: Through out - of - sample analysis of 48 US stock portfolios of different scales, it is proved that the proposed algorithm can significantly outperform passive and active benchmark investment strategies under different numbers of stocks, selection criteria and trading cost levels. In general, this paper aims to explore how to use deep reinforcement learning techniques to improve the effectiveness of high - dimensional stock portfolio management, especially in the complex and changeable financial market environment, providing a new solution.