MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing

Boris Ter-Avanesov,Homayoon Beigi
DOI: https://doi.org/10.13140/RG.2.2.32372.56963
2024-10-14
Abstract:We explore the performance of various artificial neural network architectures, including a multilayer perceptron (MLP), Kolmogorov-Arnold network (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and a time-delay neural network (TDNN) for pricing European call options. In this study, we attempt to leverage the ability of supervised learning methods, such as ANNs, KANs, and gradient-boosted decision trees, to approximate complex multivariate functions in order to calibrate option prices based on past market data. The motivation for using ANNs and KANs is the Universal Approximation Theorem and Kolmogorov-Arnold Representation Theorem, respectively. Specifically, we use S\&P 500 (SPX) and NASDAQ 100 (NDX) index options traded during 2015-2023 with times to maturity ranging from 15 days to over 4 years (OptionMetrics IvyDB US dataset). Black \& Scholes's (BS) PDE \cite{Black1973} model's performance in pricing the same options compared to real data is used as a benchmark. This model relies on strong assumptions, and it has been observed and discussed in the literature that real data does not match its predictions. Supervised learning methods are widely used as an alternative for calibrating option prices due to some of the limitations of this model. In our experiments, the BS model underperforms compared to all of the others. Also, the best TDNN model outperforms the best MLP model on all error metrics. We implement a simple self-attention mechanism to enhance the RNN models, significantly improving their performance. The best-performing model overall is the LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperforms the TDNN and MLP models. We analyze the performance of all models by ticker, moneyness category, and over/under/correctly-priced percentage.
Computational Finance,Computational Engineering, Finance, and Science,Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to explore the performance of various artificial neural network architectures, including Multilayer Perceptron (MLP), Kolmogorov-Arnold Network (KAN), Long Short-Term Memory-Gated Recurrent Unit Hybrid Recurrent Neural Network (LSTM-GRU Hybrid RNN), and Time Delay Neural Network (TDNN), in pricing European call options. The primary objective of the study is to approximate complex multivariable functions using supervised learning methods (such as ANNs, KANs, and Gradient Boosting Decision Trees) to calibrate option prices based on historical market data. Specifically, the paper uses S&P 500 (SPX) and NASDAQ 100 (NDX) index options traded between 2015 and 2023, with expiration times ranging from 15 days to over 4 years. The paper compares the pricing performance of the Black-Scholes (BS) partial differential equation model with actual data as a benchmark. The BS model relies on strong assumptions, and existing literature has pointed out that actual data does not match its predictions. Therefore, supervised learning methods are widely used as an alternative to calibrate option prices to overcome some limitations of the BS model. Through experiments, the paper finds that the BS model performs worse than all other models. Among them, the best MLP model outperforms the best TDNN model on all error metrics. Additionally, implementing a simple self-attention mechanism significantly improves the performance of the RNN models. The overall best-performing model is the LSTM-GRU Hybrid RNN model with an attention mechanism. The KAN model also outperforms the TDNN and MLP models. The paper also analyzes the performance of all models by stock code, in-the-money/out-of-the-money categories, and the percentage of overestimation/underestimation/correct pricing. Since some errors are complementary (i.e., the percentages of overestimation and underestimation in some in-the-money/out-of-the-money categories are opposite), it may be beneficial to combine the best models.