Research on Logistics Path Frequent Patterns Based on Parallel Apriori

Jingjing CAO,Xinxin REN,Xianhao XU
DOI: https://doi.org/10.3778/j.issn.1002-8331.1803-0236
2019-01-01
Abstract:The traditional method of frequent path mining analysis is realized by the association rule algorithm. However, when dealing with large data sets, the traditional association rules algorithm will take up too much memory and process data slowly. In this paper, a parallel Apriori algorithm based on Fuzzy c-means clustering algorithm is proposed. The model performs clustering analysis of the original data set by Fuzzy c-means algorithm, divides the logistics path data which is considered as the same district into a data cluster with high similarity. Then the model utilizes the Apriori algorithm to mine the frequent paths in this district, so as to obtain the frequent logistics path of each area. Meanwhile, the algorithm is parallelized through the Hadoop platform, which can effectively improve the efficiency and the quality of the algorithm. Through the analysis of the frequent path of logistics, managers can better understand the flow of goods and make the de-cision of the optimization of the delivery path.
What problem does this paper attempt to address?