A Graph Based Approach for Mining Significant Places in Trajectory Data

Shimin Wang,Xinzheng Niu,Philippe Fournier-Viger,Dongmei Zhou,Fan Min
DOI: https://doi.org/10.1016/j.ins.2022.07.046
IF: 8.1
2022-01-01
Information Sciences
Abstract:Significant place mining in spatiotemporal trajectory data is a key task for mobile pattern mining, useful for supporting location-aware services. State-of-the-art trajectory clustering algorithms utilize a density-based distance measure. However, some major problems with this approach are that (1) results are often inaccurate, especially on data of varying density, (2) the user must fine-tune many thresholds that are unintuitive to set, and (3) boundary points between clusters are often assigned to the wrong locations. Performance is also a major issue as many state-of-the-art algorithms have a very high time complexity. Motivated by these issues, this paper proposes an approach inspired by the data field theory and community detection. It is a graph-based significant place mining algorithm, called GB-SPM, for capturing and characterizing the essence of similarity between nodes. GB-SPM first applies a novel low index neighborhood velocity point filtration method to extract characteristic points. Then, a characteristic point index neighborhood is used to map them to graph nodes. In this way, the original problem is transformed into a community detection problem in complex community networks. Finally, a new edge weight metric is proposed to capture and characterize the nature of similarity between nodes. To evaluate clustering quality, we used the Silhouette (SI) for unannotated data to value inter-cluster separation and intra-cluster homogeneity. To evaluate mining effectiveness, we used Matthew's correlation coefficient (MCC) for annotated data. Numerous experiments were carried out on real world datasets, and the accuracy and performance of the designed algorithm was compared with the state-of-the-art algorithms. Results show that GB-SPM improves on average SI by 13.9%, MCC by 20.7%, and runtime by 5.15 times. (C) 2022 Elsevier Inc. All rights reserved.
What problem does this paper attempt to address?