Abstract:Non- Volatile Memory (NVM) offers low-latency, non-volatility, and byte-addressability, positioning it as a highly promising device for database performance enhancement. Cur-rent research primarily focuses on utilizing LSM-Tree in con-junction with NVM to reduce write amplification and alleviate write stall issues. However, the comprehensive potential of NVM in simultaneously augmenting both read and write performances remains underexplored. And the previous NVM-enhanced LSM-Tree also ignores the sensitivity of NVM to small-grained random reads and writes, which we believe is the key to further improving read and write performance. To address these issues, we propose BushStore, an innovative LSM-Tree variant specifically optimized for NVM. BushStore is designed with a three-level architecture, where the higher levels of BushStore contain a group of immutable, non-clustered B+Trees, replacing traditional SSTables. By storing the non-leaf nodes in the DRAM and the leaf nodes in NVM, and separating the data pages from the indexes, these B+Trees are able to exhibit high performance for diverse read and write operations. Our approach encompasses four key techniques to significantly boost system efficiency: First, we develop novel data structures that localize read/write operations to confined NVM areas, enhancing access speed. Second, we optimize the key-value data handling during flushing and compaction phases, leveraging the superior scanning and sequantial writing capabilities of B+Trees to ex-pedite write and compaction processes. Third, we dynamically adjust the B+Tree sizes, enabling a balanced and optimized flushing and compaction process, thereby improving overall write performance. Fourth, we implement a lazy-delete Cuckoo filtering and lazy-persistent allocation strategy to accelerate query and compaction processes. Evaluations show that BushStore exhibits high performance and scalability under synthetic and real work-loads, and achieves an average performance improvement of 3.3x in random write throughput and 4.3x in random read throughput compared to the state-of-the-art MioDB system.

Making In-Memory Learned Indexes Efficient on Disk

A Simple Yet High-Performing On-disk Learned Index: Can We Have Our Cake and Eat it Too?

Updatable Learned Indexes Meet Disk-Resident DBMS -- From Evaluations to Design Choices

Revisiting Learned Index with Byte-addressable Persistent Storage

A Comprehensive Performance Evaluation of Modern In-Memory Indices

BushStore: Efficient B+Tree Group Indexing for LSM-Tree in Non-Volatile Memory

Enhancing In-Memory Spatial Indexing with Learned Search

Learned Index: A Comprehensive Experimental Evaluation

A Scalable Learned Index Scheme in Storage Systems

Learned Index for Spatial Queries

Evaluating Learned Indexes for External-Memory Joins

Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes

COLIN: A Cache-Conscious Dynamic Learned Index with High Read/Write Performance

LISA: A Learned Index Structure for Spatial Data

LearnedKV: Integrating LSM and Learned Index for Superior Performance on SSD

A Pluggable Learned Index Method via Sampling and Gap Insertion

Micro-architectural Analysis of a Learned Index

Why Are Learned Indexes So Effective but Sometimes Ineffective?

SLBRIN: A Spatial Learned Index Based on BRIN

WIPE: a Write-Optimized Learned Index for Persistent Memory

Main Memory Indexing: the Case for Bd-Tree