Abstract:Non- Volatile Memory (NVM) offers low-latency, non-volatility, and byte-addressability, positioning it as a highly promising device for database performance enhancement. Cur-rent research primarily focuses on utilizing LSM-Tree in con-junction with NVM to reduce write amplification and alleviate write stall issues. However, the comprehensive potential of NVM in simultaneously augmenting both read and write performances remains underexplored. And the previous NVM-enhanced LSM-Tree also ignores the sensitivity of NVM to small-grained random reads and writes, which we believe is the key to further improving read and write performance. To address these issues, we propose BushStore, an innovative LSM-Tree variant specifically optimized for NVM. BushStore is designed with a three-level architecture, where the higher levels of BushStore contain a group of immutable, non-clustered B+Trees, replacing traditional SSTables. By storing the non-leaf nodes in the DRAM and the leaf nodes in NVM, and separating the data pages from the indexes, these B+Trees are able to exhibit high performance for diverse read and write operations. Our approach encompasses four key techniques to significantly boost system efficiency: First, we develop novel data structures that localize read/write operations to confined NVM areas, enhancing access speed. Second, we optimize the key-value data handling during flushing and compaction phases, leveraging the superior scanning and sequantial writing capabilities of B+Trees to ex-pedite write and compaction processes. Third, we dynamically adjust the B+Tree sizes, enabling a balanced and optimized flushing and compaction process, thereby improving overall write performance. Fourth, we implement a lazy-delete Cuckoo filtering and lazy-persistent allocation strategy to accelerate query and compaction processes. Evaluations show that BushStore exhibits high performance and scalability under synthetic and real work-loads, and achieves an average performance improvement of 3.3x in random write throughput and 4.3x in random read throughput compared to the state-of-the-art MioDB system.

Revisiting Learned Index with Byte-addressable Persistent Storage

BushStore: Efficient B+Tree Group Indexing for LSM-Tree in Non-Volatile Memory

WIPE: a Write-Optimized Learned Index for Persistent Memory

Exploiting Persistent CPU Cache for Scalable Persistent Hash Index

The past, present and future of indexing on persistent memory

Revisiting Secondary Indexing in LSM-based Storage Systems with Persistent Memory.

A Simple Yet High-Performing On-disk Learned Index: Can We Have Our Cake and Eat it Too?

Making In-Memory Learned Indexes Efficient on Disk

Learned Index: A Comprehensive Experimental Evaluation

SLBRIN: A Spatial Learned Index Based on BRIN

A Scalable Learned Index Scheme in Storage Systems

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

A Comprehensive Performance Evaluation of Modern In-Memory Indices

Perseid: A Secondary Indexing Mechanism for LSM-Based Storage Systems

Optimizing LSM-based indexes for disaggregated memory

COLIN: A Cache-Conscious Dynamic Learned Index with High Read/Write Performance

HIndex-FLSM: Fragmented Log-Structured Merge Trees Integrated with Heat and Index

Revisiting PM-Based B +-Tree With Persistent CPU Cache

CCL-BTree: A Crash-Consistent Locality-Aware B+-Tree for Reducing XPBuffer-Induced Write Amplification in Persistent Memory.

PhaST: Hierarchical Concurrent Log-Free Skip List for Persistent Memory

<inline-formula> <tex-math notation="LaTeX">$\mathsf{B}^{p}$ </tex-math></inline-formula>-<inline-formula> <tex-math notation="LaTeX">$\mathsf{Tree}$ </tex-math></inline-formula>: A Predictive <inline-formula> <tex-math notation="LaTeX">$\mathsf{B}^{+}$ </tex-math></inline-formula>-<inline-formula>