Pork Price Prediction Using Topic Modeling and Feature Scoring Method

Tserenpurev Chuluunsaikhan,Kwan-Hee Yoo,HyungChul Rah,Aziz Nasridinov
DOI: https://doi.org/10.1007/978-981-33-6757-9_35
2021-01-01
Abstract:A large amount of text data may hide a numeric connection related to some other subject, for example, price. In this paper, we aimed to predict pork prices based on topic modeling and word scoring method. This study consists of four steps, such as feature extraction, word scoring, feature selection, and prediction. Any prediction model has input/features and output. We extracted our features from online news data using the topic modeling technique (LDA). Also, we selected the daily pork price as the output. After that, we created a word scoring corpus using the result of LDA and price movements. Because of our features and output are numeric values, we applied the Pearson's correlation as feature selection. To check our word scoring method, we built a prediction model of pork price using LSTM. We evaluated the model without feature selection and with feature selection. We used RMSE, MAE, and MAPE to measure our model accuracy. The results show that our model can be used in the price prediction of pork and other agricultural commodities.
What problem does this paper attempt to address?