Abstract:We proposed a flash-optimized unbalanced R-tree index for flash memory.We introduced overflow nodes to defer node-splitting operations on the index.We presented a new buffering scheme to cache the node updates to the index.We conducted experiments on both real solid state drives and a flash simulation framework. R-tree has been widely used in spatial data management and data analysis to improve the performance of spatial data retrieval. However, the original R-tree is designed for magnetic disks, and has poor performance on flash memory, due to the special features of flash memory such as asymmetric read/write speeds (fast read, slow write) and the erase-before-write feature. Particularly, the original updating mechanism of R-tree usually has to update a few interior nodes when inserting an indexing item into or deleting an item from a leaf node, yielding many slow writes to flash memory. With the wide use of flash memory in many location-based fields, e.g., to store moving trajectories in intelligent transportation systems, how to optimize R-tree for flash memory has become a critical issue. In this paper, we propose a novel spatial index named Flash-Optimized R-tree that is optimized for flash memory. In particular, we propose to defer the node-splitting operations on R-tree by introducing overflow nodes, which results in an unbalanced tree structure. With this mechanism, we can reduce random writes to flash memory and improve the overall performance of R-tree. In addition, we present a new buffering scheme to efficiently cache the updates to the tree, which can further reduce random writes to flash memory. We conduct extensive experiments on real flash-memory storage devices as well as a flash memory simulation platform to evaluate the performance of our proposal, and the results suggest the efficiency of our proposal with respect to different metrics.

FlashR: R-Programmed Parallel and Scalable Machine Learning using SSDs

FlashRNN: Optimizing Traditional RNNs on Modern Hardware

FLASH 1.0: A Software Framework for Rapid Parallel Deployment and Enhancing Host Code Portability in Heterogeneous Computing

Reo: Enhancing Reliability and Efficiency of Object-based Flash Caching

Flash: A Framework for Programming Distributed Graph Processing Algorithms.

Towards Efficient Concurrent Scans on Flash Disks.

Design and implementation of reconfigurable acceleration for in-memory distributed big data computing.

Cognitive Ssd: A Deep Learning Engine For In-Storage Data Retrieval

FlashDecoding++: Faster Large Language Model Inference on GPUs

Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention

Optimizing R-tree for flash memory.

Flash-DBSim: A Simulation Tool for Evaluating Flash-based Database Algorithms

Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference

Flash-based Computing In-Memory Scheme for IOT.

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering

RecSSD: near data processing for solid state drive based recommendation inference

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor Contraction

IMPACT:InMemory ComPuting Architecture Based on Y-FlAsh Technology for Coalesced Tsetlin Machine Inference

Parallelization of Classification Algorithms Based on SparkR