MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing

Boris Ter-Avanesov,Homayoon Beigi

DOI: https://doi.org/10.13140/RG.2.2.32372.56963

2024-10-14

Abstract:We explore the performance of various artificial neural network architectures, including a multilayer perceptron (MLP), Kolmogorov-Arnold network (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and a time-delay neural network (TDNN) for pricing European call options. In this study, we attempt to leverage the ability of supervised learning methods, such as ANNs, KANs, and gradient-boosted decision trees, to approximate complex multivariate functions in order to calibrate option prices based on past market data. The motivation for using ANNs and KANs is the Universal Approximation Theorem and Kolmogorov-Arnold Representation Theorem, respectively. Specifically, we use S\&P 500 (SPX) and NASDAQ 100 (NDX) index options traded during 2015-2023 with times to maturity ranging from 15 days to over 4 years (OptionMetrics IvyDB US dataset). Black \& Scholes's (BS) PDE \cite{Black1973} model's performance in pricing the same options compared to real data is used as a benchmark. This model relies on strong assumptions, and it has been observed and discussed in the literature that real data does not match its predictions. Supervised learning methods are widely used as an alternative for calibrating option prices due to some of the limitations of this model. In our experiments, the BS model underperforms compared to all of the others. Also, the best TDNN model outperforms the best MLP model on all error metrics. We implement a simple self-attention mechanism to enhance the RNN models, significantly improving their performance. The best-performing model overall is the LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperforms the TDNN and MLP models. We analyze the performance of all models by ticker, moneyness category, and over/under/correctly-priced percentage.

Computational Finance,Computational Engineering, Finance, and Science,Machine Learning

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to explore the performance of various artificial neural network architectures, including Multilayer Perceptron (MLP), Kolmogorov-Arnold Network (KAN), Long Short-Term Memory-Gated Recurrent Unit Hybrid Recurrent Neural Network (LSTM-GRU Hybrid RNN), and Time Delay Neural Network (TDNN), in pricing European call options. The primary objective of the study is to approximate complex multivariable functions using supervised learning methods (such as ANNs, KANs, and Gradient Boosting Decision Trees) to calibrate option prices based on historical market data. Specifically, the paper uses S&P 500 (SPX) and NASDAQ 100 (NDX) index options traded between 2015 and 2023, with expiration times ranging from 15 days to over 4 years. The paper compares the pricing performance of the Black-Scholes (BS) partial differential equation model with actual data as a benchmark. The BS model relies on strong assumptions, and existing literature has pointed out that actual data does not match its predictions. Therefore, supervised learning methods are widely used as an alternative to calibrate option prices to overcome some limitations of the BS model. Through experiments, the paper finds that the BS model performs worse than all other models. Among them, the best MLP model outperforms the best TDNN model on all error metrics. Additionally, implementing a simple self-attention mechanism significantly improves the performance of the RNN models. The overall best-performing model is the LSTM-GRU Hybrid RNN model with an attention mechanism. The KAN model also outperforms the TDNN and MLP models. The paper also analyzes the performance of all models by stock code, in-the-money/out-of-the-money categories, and the percentage of overestimation/underestimation/correct pricing. Since some errors are complementary (i.e., the percentages of overestimation and underestimation in some in-the-money/out-of-the-money categories are opposite), it may be beneficial to combine the best models.

MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing

Option Pricing with Convolutional Kolmogorov-Arnold Networks

American Option Pricing using Self-Attention GRU and Shapley Value Interpretation

American Call Options Pricing With Modular Neural Networks

Pricing American Options using Machine Learning Algorithms

Comparison and Analysis of SNN and RNN Results for Option Pricing and Deep Hedging Using Artificial Neural Networks (ANN)

Pricing European Options with Google AutoML, TensorFlow, and XGBoost

Improving option price forecasts with neural networks and support vector regressions

Machine Learning Methods for Pricing Financial Derivatives

Pricing options with a new hybrid neural network model

Jump Diffusion-Informed Neural Networks with Transfer Learning for Accurate American Option Pricing under Data Scarcity

Deep Calibration With Artificial Neural Network: A Performance Comparison on Option Pricing Models

Data-driven option pricing using single and multi-asset supervised learning

Machine learning for option pricing: an empirical investigation of network architectures

Pricing and hedging derivative securities with neural networks: Bayesian regularization, early stopping, and bagging

KANOP: A Data-Efficient Option Pricing Model using Kolmogorov-Arnold Networks

Financial option valuation by unsupervised learning with artificial neural networks

Neural Network Learning of Black-Scholes Equation for Option Pricing

Pricing Options and Computing Implied Volatilities using Neural Networks

Option Pricing by Neural Stochastic Differential Equations: A Simulation-Optimization Approach

Application of Tensor Neural Networks to Pricing Bermudan Swaptions