Enhancing stock market prediction using three-phase classifier and EM-EPO optimization with news feeds and historical data
Shilpa Dixit,Nitasha Soni,Dixit, Shilpa
DOI: https://doi.org/10.1007/s11042-023-17184-x
IF: 2.577
2023-10-03
Multimedia Tools and Applications
Abstract:Stock price forecasting is a crucial area of research that demands a thorough comprehension of market dynamics and sophisticated analytical methods. Deep neural networks have recently demonstrated considerable potential for enabling academics to create extremely precise models for foretelling financial patterns. Another quickly developing technology is Natural language processing (NLP), which is increasingly used to evaluate financial data, notably news, and social media sentiment, and forecast the course of the market. An innovative deep learning-based stock market prediction model is created in this research paper. The historic stock market data and news feed as the information source. The acquired raw data (news data) are pre-processed using lowercase text conversion, punctuation removal, stop word removal, tokenization of sentences, normalization, and a bag of words technique. Then, from the pre-processed news data, the features such as Parts of Speech (PoS), Term Frequency-Inverse Document Frequency (TF-IDF), and N-gram-based features are extracted. In addition, the Moving Average Convergence/Divergence oscillator (MACD), and Relative Strength Index (RSI) based features are extracted from the historic stock market data. The extracted features from the historic stock market data and news feeds are fused. The optimal features are then chosen from the fused features using a new hybrid optimization model called Exchange Market Emperor Pigeon Optimization (EMEPO). The proposed EMEPO model is a combination of the standard Exchange Market Algorithm (EMA) and Emperor Pigeon Optimization (EPO). A new three-phase classifier is introduced in this research work, for accurate stock price forecasting. The proposed three-phase classifier includes Convolutional neural networks (CNN), Bidirectional long short-term memories (Bi-LSTM), and Autoencoder. The three-phase classifier is trained using the selected EMEPO-based features. The projected outcome from the three-phase classifier portrays the stock market prices. In all three feature selection events, the suggested method produces positive outcomes. In Case 1 (Open/Close), the Mean Absolute Error (MAE) value is 0.051582, suggesting a minimal average difference between the anticipated and actual values. The accuracy of the model in forecasting percentage errors is shown by the Mean Absolute Percentage Error (MAPE), which is 11.28154%. Similar results are obtained by the model in Case 2 (High/Low/Close), with an MAE of 0.056408 and a MAPE of 12.0081%. These results show how well the suggested strategy works for anticipating stock prices with accuracy.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering