Tinba: Incremental partitioning for efficient trajectory analytics

Ruijie Tian,Weishi Zhang,Fei Wang,Kemal Polat,Fayadh Alenezi
DOI: https://doi.org/10.1016/j.aei.2023.102064
IF: 8.8
2023-06-29
Advanced Engineering Informatics
Abstract:Applications with mobile and sensing devices have already become ubiquitous. In most of these applications, trajectory data is continuously growing to huge volumes. Existing systems for big trajectory data organize trajectories at distributed block storage systems. Systems like DITA that use block storage (e.g., 128 MB each) are more efficient for analytical queries, but they cannot incrementally maintain the partitioned data and do not support delete operations, resulting in difficulties in trajectory analytics. In this paper, we propose an incremental trajectory partitioning framework Tinba that enables distributed block storage systems to efficiently maintain optimized partitions under incremental updates of trajectories. We employ a data flushing technique to bulk ingest trajectory data for random writing in distributed file system (DFS). We recast the incremental partitioning problem as an optimal partitioning problem and prove its NP-hardness. A cost–benefit model is proposed to address the optimal partitioning problem. Moreover, Tinba supports most of the existing similarity measures to quantify the similarity between trajectories. A heuristic is developed to instantiate the Tinba framework. Comprehensive experiments on real-world and synthetic datasets demonstrate the advancements in ingestion performance and partition quality, as opposed to other trajectory partition methods.
engineering, multidisciplinary,computer science, artificial intelligence
What problem does this paper attempt to address?