Abstract:The Multiple Time Bucket Join (MTB-join) algorithm is the state of the art for processing the continuous intersection join (CI-join) query over moving objects. It considerably outperforms alternatives, but still falls short of real-time application performance requirements for large sets of moving objects. In this paper, we achieve real-time performance for the CI-join query over large sets of moving objects by exploiting the computational power of commodity graphics processing units (GPUs). We first analyze how the main characteristics of the MTB-join algorithm make it ill suited to GPUs and identify key challenges in designing efficient GPU-based algorithms for the query. We then address these challenges by developing the multi-layered grid join (MLG-join) algorithm which has the following key features: (i) memory locality friendly indexing, (ii) no dynamic memory allocation, (iii) in-place object updates, (iv) lock-free concurrent updates, and (v) massive parallelism. These features unleash the full potential of the memory bandwidth and parallel processing of GPUs. Furthermore, we conduct a theoretical analysis which can predict the pruning power of the MLG-join algorithm given certain parameter values used in the algorithm. This allows us to select optimal parameter values. Through extensive experimental results, we show that our analysis accurately models the MLG-join algorithm's sensitivity to parameter values. The proposed MLG-join algorithm outperforms the MTB-join algorithm, and a GPU-based nested-loops join algorithm, by up to two orders of magnitude, and achieves real-time performance for CI-join queries on large sets of moving objects.

Modular Pipeline Architecture for Accelerating Join Operation in RDBMS

High-Parallelism and Pipelined Architecture for Accelerating Sort-Merge Join on FPGA

Performance Evaluation for Distributed Join Based on MapReduce.

Optimization Factor Analysis Of Large-Scale Join Queries On Different Platforms

Join Query Optimization Based on MapReduce under Skewed Data

Resource-Efficient Parallel Tree-Based Join Architecture on FPGA

Modular Serial Pipelined Sorting Architecture For Continuous Variable-Length Sequences With A Very Simple Control Strategy

Scalable Parallel Join for Huge Tables

Robust Join Processing with Diamond Hardened Joins

Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment

Runtime-optimized Multi-way Stream Join Operator for Large-scale Streaming data

The Sort-Merge-Shrink join

RPK-table Based Efficient Algorithm for Join-Aggregate Query on MapReduce.

A Filter-Based Multi-Join Algorithm in Cloud Computing Environment

Distributed GPU Joins on Fast RDMA-capable Networks

A novel agent-based parallel ETL system for massive data

Real-time Continuous Intersection Joins over Large Sets of Moving Objects Using Graphic Processing Units

Efficiently Processing Large Relational Joins on GPUs

Adaptive Multi-join Query Processing in PDBMS

Utilizing the column imprints to accelerate no‐partitioning hash joins in large‐scale edge systems

Efficient Join Synopsis Maintenance for Data Warehouse.