Efficient Parallel Algorithm For Mining Association Rules Based On Master-Slave Model

Bo He,Yuan Chen,Huaqiu Wang,Yue Wang,Quanli Liu
2006-01-01
Abstract:Most parallel mining algorithms worked on net structure network and didn't set centre node. Furthermore, most of them adopted Apriori-like algorithm framework. Whereas there were some problems in these algorithms more or less: data deviation, a lot of candidate itemsets, heavy communication traffic, more synchronization and frequent database scanning. Aiming at these problems, this paper proposed an efficient parallel algorithm for mining association rules based on masterslave model, namely, EPMAR algorithm, which set centre node. It distributed data according horizontal projection method. EPMAR algorithm made computer nodes compute local frequent itemsets independently with FP-growth algorithm, then the centre node exchanged data with other computer nodes and combined, finally, global frequent itemsets were gained. EPMAR algorithm required far less communication traffic by the searching strategies of topdown and bottom-up. It has a good speedup and scalability. Theoretical analysis and experimental results suggest that EPMAR algorithm is fast and effective.
What problem does this paper attempt to address?