Abstract:Modern CPUs keep integrating more cores and large size cache, which is beneficial for in-memory databases to improve parallel processing power and cache locality. While state-of-the-art CPUs have diverse architectures and roadmaps such as large core count and large cache size (AMD x86), moderate core count and cache size (intel x86), large core count and moderate cache size (ARM), exploring in-memory databases performance characteristics for different CPU architectures is important for in-memory database designs and optimizations. In this article, we develop a fine-grained in-memory database benchmark to evaluate the performance of each operator on different CPUs to explore how CPU hardware architectures influence performance. Different from well known conclusions that more cores and larger cache size can achieve higher performance, we find out that the micro cache architectures play an important role opposite to core count and cache size, the shared monolithic L3 cache with moderate size beats large disaggregated L3 cache. The experiments also show that predicting operator performance on different CPUs is difficult according to diverse CPU architectures and micro cache architectures, and different implementations of each operator are not always high or low with interleaved strong and weak performance regions influenced by CPU hardware architectures. Intel x86 CPUs represent cache-centric processor design, while AMD x86 and ARM CPUs represent computing-centric processor design, the OLAP benchmark experiments of SSB discover that OmniSciDB and OLAP Accelerator with vector-wise processing model performs well on intel x86 CPUs compared to AMD x86 CPUs and the JIT compliant based Hyper prefers to AMD x86 CPUs rather than intel x86 CPUs. The CPU roadmaps of increasing cores or improving cache locality should be considered for in-memory database algorithm design and platform selection.

The Case for Learned In-Memory Joins

Evaluating Learned Indexes for External-Memory Joins

Modeling and Benchmarking Computing-in-Memory for Design Space Exploration.

Parallel In-Memory Evaluation of Spatial Joins

Utilizing the column imprints to accelerate no‐partitioning hash joins in large‐scale edge systems

Making In-Memory Learned Indexes Efficient on Disk

A learning-based framework for spatial join processing: estimation, optimization and tuning

Forecasting the cost of processing multi-join queries via hashing for main-memory databases (Extended version)

Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last

Model Joins: Enabling Analytics Over Joins of Absent Big Tables

Learned Index for Non-Key Queries

Enhancing In-Memory Spatial Indexing with Learned Search

WWW: What, When, Where to Compute-in-Memory

NOCAP: Near-Optimal Correlation-Aware Partitioning Joins

Robust Join Processing with Diamond Hardened Joins

Exploring Fine-Grained In-Memory Database Performance for Modern CPUs

In-Memory Computing with Associative Memories: A Cross-Layer Perspective

A Simple Yet High-Performing On-disk Learned Index: Can We Have Our Cake and Eat it Too?

Design Trade-offs for a Robust Dynamic Hybrid Hash Join (Extended Version)

Learning to Optimize Join Queries With Deep Reinforcement Learning

LIFM: A Persistent Learned Index for Flash Memory