Enhanced Momentum with Momentum Transformers

Max Mason,Waasi A Jagirdar,David Huang,Rahul Murugan
2024-12-17
Abstract:The primary objective of this research is to build a Momentum Transformer that is expected to outperform benchmark time-series momentum and mean-reversion trading strategies. We extend the ideas introduced in the paper Trading with the Momentum Transformer: An Intelligent and Interpretable Architecture to equities as the original paper primarily only builds upon futures and equity indices. Unlike conventional Long Short-Term Memory (LSTM) models, which operate sequentially and are optimized for processing local patterns, an attention mechanism equips our architecture with direct access to all prior time steps in the training window. This hybrid design, combining attention with an LSTM, enables the model to capture long-term dependencies, enhance performance in scenarios accounting for transaction costs, and seamlessly adapt to evolving market conditions, such as those witnessed during the Covid Pandemic. We average 4.14% returns which is similar to the original papers results. Our Sharpe is lower at an average of 1.12 due to much higher volatility which may be due to stocks being inherently more volatile than futures and indices.
Computational Finance,Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to improve the traditional momentum and mean - reversion trading strategies by constructing a model named **Momentum Transformer**. Specifically, the research hopes: 1. **Improve the performance of trading strategies**: By introducing the Transformer architecture in deep learning, enhance the ability to capture long - term dependencies in time - series data, thereby improving the returns of trading strategies and performance after risk adjustment. 2. **Adapt to market changes**: Traditional momentum strategies may become ineffective under rapidly changing market conditions, especially when facing non - stationarity or momentum turning points (such as during the COVID - 19 pandemic). This paper attempts to make the model better adapt to these changes by combining the attention mechanism and the LSTM model. 3. **Apply to the stock market**: Previous research has mainly focused on futures and indices, while this research extends the method of Deep Momentum Networks to the stock market to verify its applicability and effectiveness in asset classes that are more accessible and have lower trading thresholds. 4. **Solve common problems in quantitative investment**: The paper mentions that during the implementation process, some common challenges in quantitative investment were encountered, such as data leakage, look - ahead bias, and survivorship bias. These problems need special attention and solution in practical applications. In summary, the main objective of this paper is to develop a new momentum trading model - Momentum Transformer, in the hope of achieving better trading performance in the stock market and overcoming the limitations of existing methods in dealing with complex market environments.