Numin: Weighted-Majority Ensembles for Intraday Trading

Aniruddha Mukherjee,Rekha Singhal,Gautam Shroff
DOI: https://doi.org/10.1145/3677052.3698656
2024-12-04
Abstract:We consider the application of machine learning models for short-term intra-day trading in equities. We envisage a scenario wherein machine learning models are submitted by independent data scientists to predict discretised ten-candle returns every five minutes, in response to five-minute candlestick data provided to them in near real-time. An ensemble model combines these multiple models via a weighted-majority algorithm. The weights of each model are dynamically updated based on the performance of each model, and can also be used to reward model owners. Each model's performance is evaluated according to two different metrics over a recent time window: In addition to accuracy, we also consider a `utility' metric that is a proxy for a model's potential profitability under a particular trading strategy. We present experimental results on real intra-day data that show that our weighted-majority ensemble techniques show improved accuracy as well as utility over any of the individual models, especially using the utility metric to dynamically re-weight models over shorter time-windows.
Computational Engineering, Finance, and Science
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the challenges of stock prediction in short - term intraday trading, especially how to improve the accuracy and potential profitability of prediction through the ensemble learning method. Specifically: 1. **Challenges in intraday trading**: In a high - frequency trading environment, the latest K - line data (i.e., opening price, highest price, lowest price, closing price, and trading volume) is provided every 5 minutes, and it is required to predict the return rate of the next 10 K - lines in real - time. Due to the intense market fluctuations and large amount of information, it is difficult for a single model to maintain high accuracy continuously. 2. **Application of ensemble learning**: To overcome the limitations of a single model, the author proposes a framework named Numin, which combines prediction models submitted by multiple independent data scientists using the Weighted - Majority Algorithm (WMA). These models adjust their weights dynamically based on historical performance to ensure that the most reliable predictions are given higher weights. 3. **Dynamic weight adjustment**: The weight of each model is updated dynamically according to its performance in the past period. The weight is not only based on accuracy but also introduces a "utility" metric, which is a proxy for the potential profitability of the model under a specific trading strategy. In this way, the system can quickly adapt to market changes and identify the best - performing models. 4. **Experimental verification**: The author conducted experiments using real - world intraday trading data, and the results show that, compared with a single model, the weighted - majority ensemble method has a significant improvement in both accuracy and utility, especially when the utility metric is used to adjust the weights dynamically in a shorter time window. ### Formula representation - **Weight update formula**: \[ w_j^{(r)}=\alpha\cdot\tilde{s}_j^{(r)}+(1 - \alpha)\cdot w_j^{(r - 1)} \] where \(\alpha=\frac{2}{|win|+ 1}\), \(\tilde{s}_j^{(r)}\) is the normalized score, and \(w_j^{(r)}\) is the weight of model \(j\) after the \(r\)-th round. - **Utility metric**: \[ U=\sum_{t = 1}^{m}(\text{actual return}-\text{expected return}) \] The utility metric evaluates the profitability of the model in actual trading. Through these methods, the Numin framework can take advantage of multiple models in a constantly changing market environment and provide more accurate and more profitable trading suggestions.