Conservative Predictions on Noisy Financial Data

Omkar Nabar,Gautam Shroff
DOI: https://doi.org/10.1145/3604237.3626859
2023-10-18
Abstract:Price movements in financial markets are well known to be very noisy. As a result, even if there are, on occasion, exploitable patterns that could be picked up by machine-learning algorithms, these are obscured by feature and label noise rendering the predictions less useful, and risky in practice. Traditional rule-learning techniques developed for noisy data, such as CN2, would seek only high precision rules and refrain from making predictions where their antecedents did not apply. We apply a similar approach, where a model abstains from making a prediction on data points that it is uncertain on. During training, a cascade of such models are learned in sequence, similar to rule lists, with each model being trained only on data on which the previous model(s) were uncertain. Similar pruning of data takes place at test-time, with (higher accuracy) predictions being made albeit only on a fraction (support) of test-time data. In a financial prediction setting, such an approach allows decisions to be taken only when the ensemble model is confident, thereby reducing risk. We present results using traditional MLPs as well as differentiable decision trees, on synthetic data as well as real financial market data, to predict fixed-term returns using commonly used features. We submit that our approach is likely to result in better overall returns at a lower level of risk. In this context we introduce an utility metric to measure the average gain per trade, as well as the return adjusted for downside risk, both of which are improved significantly by our approach.
Machine Learning,Artificial Intelligence,Computational Engineering, Finance, and Science
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the prediction problem in the financial market, especially how to make robust predictions in the case of extremely high noise (i.e., random fluctuations and uncertainties in the market). Price fluctuations in the financial market are very noisy, which makes it so that even if there are potential patterns that can be captured by machine - learning algorithms, these patterns will be masked by feature and label noise, resulting in inaccurate prediction results and high risks. To solve this problem, the author proposes a method of conservative prediction, which improves the accuracy of prediction and reduces risks through a cascading model. Specifically, the core ideas of this method include: 1. **Selective Prediction**: The model only makes predictions on those data points for which it has sufficient confidence, and chooses not to predict for uncertain data points. This can ensure the quality of prediction and avoid wrong predictions caused by noise. 2. **Cascading Model**: Train a series of models, and each model is only trained on the data that the previous model is uncertain about. During testing, data points will be passed to each model in turn until a certain model is confident enough to make a prediction. This method can increase the proportion of data that can be predicted (i.e., support) while maintaining high precision. 3. **Practicality and Risk Adjustment**: Two evaluation indicators are introduced - average trading profit (utility metric) and return adjusted for downside risk. These two indicators help evaluate the performance of the model in actual trading, especially its prediction ability in extreme market situations. In summary, this paper aims to develop a financial prediction model that can provide more reliable and useful predictions in a noisy environment, thereby helping investors make better decisions in an uncertain market and reduce investment risks.