A trajectory data density partition based distributed parallel clustering method

Jiayu WANG,Zhenyu ZHANG,Zheng CHU,Xiaohong WU
DOI: https://doi.org/10.3969/j.issn.0253-2778.2018.01.007
2018-01-01
Journal of University of Science and Technology of China
Abstract:The development of global positioning technology and location-based service have contributed to the development of trajectory big data.Trajectory clustering is one of the most important trajectory analysis tasks and has been extensively studied.Currently,most of the clustering methods operate in a single-processor mode,and large-scale trajectory data processing is a lengthy process,making it difficult to meet the strong timeliness of the trajectory analysis task.To solve the problem,a distributed parallel clustering method based on trajectory density partition is proposed.Firstly,the whole dataset is abstracted in a rectangular region,and the dataset is divided into several partitions with tasks that have almost the same amount by the transformation of the longest dimension of the rectangle,thus constructing the local datasets for distributed parallel clustering.Then the worker servers implement the DBSCAN clustering algorithm for the local partitions respectively,and the manager server merges and integrates the local clustering results.The experimental results show that the algorithm is effective and improves the computational rate of clustering analysis to a certain degree.
What problem does this paper attempt to address?