Research on prediction of online purchasing behavior based on hybrid model

Xi Chen,Shi Ding,Yongsheng Xiang,Lin Liu
DOI: https://doi.org/10.1088/1742-6596/1827/1/012075
2021-03-01
Journal of Physics: Conference Series
Abstract:The research on the potential purchase behavior of users can help merchants develop better marketing strategies. At present, many research methods of online purchasing behavior are based on simple rule prediction, and the prediction results are not satisfactory. We design a hybrid model of Gradient Boosting Decision Tree and logistic regression to accurately predict the purchase behavior of users, which combines the association characteristics between users and commodities. Firstly, clustering algorithm and association rules are used to solve the problem of data imbalance and mine more potential related features. This scheme not only improves the processing efficiency of large data, but also solves the problem of user cold start. Secondly, we construct a scalable tree enhancement system (XGBoost) to train the initial feature set, which is a strong classifier composed of several weak classifiers. A new training set combines the new features with the original features through feature reconstruction, and a hybrid machine learning system is constructed by logistic regression (LR) model. Finally, the LR model is trained by the new training set. Compared with the existing schemes, the integrated decision tree model can train more sample sets with less resources. The experimental results show that the accuracy of the hybrid model is better than single model, and the F1_score is higher.
Computer Science
What problem does this paper attempt to address?