Deep Reinforcement Learning for Stock Trading with Behavioral Finance Strategy.

Shilong Deng,Zetao Zheng,Hongcai He,Jie Shao
DOI: https://doi.org/10.1007/978-3-031-46664-9_36
2023-01-01
Abstract:Stock trading is a challenging task and has attracted extensive attention from artificial intelligence researchers. Deep Reinforcement Learning (DRL) approaches, which directly generate trading decisions by maximizing the expected return, have made more breakthroughs than supervised learning approaches. However, most existing DRL approaches ignore essential knowledge in the finance field (e.g., the momentum investment strategy) and fail to generate sophisticated trading decisions. Specifically, the momentum investment strategy, which is popular among professional investors for its profitability, aims to buy stocks that were past winners (with above-average performance) and sell past losers (with below-average performance). Inspired by this concept, we propose Momentum Investment Twin Delay Deep Deterministic (MITD3) policy gradient algorithm based on TD3. At the core of MITD3, a Momentum Investing critic (MI-critic) computes the Q-value according to the state-action pair and advantage historical performance of the actor, and gives a higher Q-value to encourage past winners and a lower value to penalize past losers. In addition, we devise the cross-time module in MITD3, in which the current actor interacts with the environment by receiving past states and making trading decisions, to evaluate the advantage historical performance of the actor. Experimental results on real market data show that our proposed MITD3 outperforms state-of-the-art DRL approaches and generates intuitively explainable stock trading decisions.
What problem does this paper attempt to address?