Distributed Skyline Trajectory Query Processing

Amina Belhassena,Hongzhi Wang
DOI: https://doi.org/10.1145/3063955.3063974
2017-01-01
Abstract:The massive amount of trajectory data collected from GPS has emerged in recent year. Many researchers proposed trajectory queries such as top-k query. They focused to solve them based on distance and text relevance. However, the weight of these queries are unknown. Therefore, we plan to handle the trajectory skyline query based on distance and activity keywords. Furthermore, with the huge amount of keyword semantic trajectories, user may put the wrong activity keyword to search its trajectory. Therefore, it's hard to extract the trajectory based on the exact keyword activity. In this paper, we focused to handle the trajectory fuzzy problem based on edit distance and activity weight. To accelerate the query processing, initially, we used a Distributed Mining Trajectory based on R-tree DMTR-Tree to organize the big trajectory data, then we developed an efficient algorithm to handle the trajectory skyline query. Also, for a rapid computation of the algorithm, we used the cluster computing framework of Apache Spark with MapReduce as programing model. Theoretical analysis and the experimental results show that query-processing algorithm is efficient and achieve the scalability.
What problem does this paper attempt to address?