Research on Statistics-Based Model for E-Commerce User Purchase Prediction

Huailin Dong,Lingwei Xie,Zhongnan Zhang
DOI: https://doi.org/10.1109/iccse.2015.7250308
2015-01-01
Abstract:This paper describes our work for ALIDATA DISCOVERY competition. Through analyzing massive realworld user action data provided by Tmall, one of the largest B2C online retail platforms in China, we try to predict future user purchases. The prediction results are judged by F(1)Score that is consist of two parts, precision and recall rate. The provided data set contains more than 500 million action records from over 12 million distinct users. Such a massive data set drives us to finish the task in MapReduce fashion on the Open Data Processing Service (ODPS) platform. According to statistical results, we classify all users into different groups firstly. Then the rule model, timing model, statistics model are adopted for predicting future user purchases. By comparison, the statistics model obtains the best F(1)Score.
What problem does this paper attempt to address?