Combining Autoregressive Integrated Moving Average Model and Gaussian Process Regression to Improve Stock Price Forecast

Shiying Tu,Jiehu Huang,Huailong Mu,Juan Lu,Ying Li
DOI: https://doi.org/10.3390/math12081187
IF: 2.4
2024-04-16
Mathematics
Abstract:Stock market performance is one key indicator of the economic condition of a country, and stock price forecasting is important for investments and financial risk management. However, the inherent nonlinearity and complexity in stock price movements imply that simple conventional modeling techniques are not adequate for stock price forecasting. In this paper, we present a hybrid model (ARIMA + GPRC) which combines the autoregressive integrated moving average (ARIMA) model and Gaussian process regression (GPR) with a combined covariance function (GPRC). The proposed hybrid model can account for both the linearity and nonlinearity in stock price movements. Based on daily data on three stocks listed on the Shanghai Stock Exchange (SSE), it is found that GPRC outperforms GPR with a single covariance function. Further, the proposed hybrid model is compared with the ARIMA model, artificial neural network (ANN), and GPRC model. Based on the forecasting trend and the statistical performance of the four models, the ARIMA + GPRC model is found to be the dominant model for stock price forecasting and can significantly improve forecasting performance.
mathematics
What problem does this paper attempt to address?
The paper attempts to address the issue of accurately capturing both linear and nonlinear features in stock price prediction. Traditional single models (such as the ARIMA model) can handle linear relationships in time series data but fail to effectively capture nonlinear structures. On the other hand, other nonlinear models (such as neural networks) can handle nonlinear relationships but may overlook linear features in the data. Therefore, the paper proposes a hybrid model combining the Autoregressive Integrated Moving Average (ARIMA) model and Gaussian Process Regression (GPR) (ARIMA + GPRC), aiming to capture both linear and nonlinear features in stock price movements to improve prediction performance. Specifically, the paper achieves this goal through the following steps: 1. **Data Acquisition**: The daily closing prices of three companies listed on the Shanghai Stock Exchange (SSE) (BOE, GREE, ZTE) from January 4, 2007, to September 30, 2017, were selected as the research subjects. 2. **Modeling**: First, the ARIMA model is used to extract the linear features of the sample data and perform fitting and prediction. Then, based on the fitting results, the residual sequence of the original data is obtained, and the GPR model is used to train the residual sequence to capture the nonlinear features of the data. 3. **Prediction**: The prediction performance of the single kernel function GPR model (GPRS) and the combined kernel function GPR model (GPRC) on the three stocks is compared. 4. **Evaluation**: Metrics such as Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) are used to evaluate the prediction performance of the models. The proposed hybrid model (ARIMA + GPRC) is compared with the ARIMA model, GPRC model, and Artificial Neural Network (ANN) model. Through the above methods, the paper aims to provide a more effective stock price prediction model to help investors and financial institutions make better investment decisions.