Abstract:Non- Volatile Memory (NVM) offers low-latency, non-volatility, and byte-addressability, positioning it as a highly promising device for database performance enhancement. Cur-rent research primarily focuses on utilizing LSM-Tree in con-junction with NVM to reduce write amplification and alleviate write stall issues. However, the comprehensive potential of NVM in simultaneously augmenting both read and write performances remains underexplored. And the previous NVM-enhanced LSM-Tree also ignores the sensitivity of NVM to small-grained random reads and writes, which we believe is the key to further improving read and write performance. To address these issues, we propose BushStore, an innovative LSM-Tree variant specifically optimized for NVM. BushStore is designed with a three-level architecture, where the higher levels of BushStore contain a group of immutable, non-clustered B+Trees, replacing traditional SSTables. By storing the non-leaf nodes in the DRAM and the leaf nodes in NVM, and separating the data pages from the indexes, these B+Trees are able to exhibit high performance for diverse read and write operations. Our approach encompasses four key techniques to significantly boost system efficiency: First, we develop novel data structures that localize read/write operations to confined NVM areas, enhancing access speed. Second, we optimize the key-value data handling during flushing and compaction phases, leveraging the superior scanning and sequantial writing capabilities of B+Trees to ex-pedite write and compaction processes. Third, we dynamically adjust the B+Tree sizes, enabling a balanced and optimized flushing and compaction process, thereby improving overall write performance. Fourth, we implement a lazy-delete Cuckoo filtering and lazy-persistent allocation strategy to accelerate query and compaction processes. Evaluations show that BushStore exhibits high performance and scalability under synthetic and real work-loads, and achieves an average performance improvement of 3.3x in random write throughput and 4.3x in random read throughput compared to the state-of-the-art MioDB system.

Enabling Efficient NVM-Based Text Analytics Without Decompression

The Impact of Non-linear NVM Devices on In-Memory Computing

Reinforcement Learning Based Data Compression for Energy-Efficient Non-volatile Caches.

TADOC: Text Analytics Directly on Compression

BushStore: Efficient B+Tree Group Indexing for LSM-Tree in Non-Volatile Memory

G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression

Exploring Data Analytics Without Decompression on Embedded GPU Systems

System Evaluation of the Intel Optane Byte-addressable NVM

Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems

An Efficient Framework for Implementing Persist Data Structures on Remote NVM.

F-TADOC: FPGA-Based Text Analytics Directly on Compression with HLS

RC-NVM: Enabling Symmetric Row and Column Memory Accesses for In-memory Databases

Containing Analog Data Deluge at Edge through Frequency-Domain Compression in Collaborative Compute-in-Memory Networks

NOBtree: A NUMA-Optimized Tree Index for Nonvolatile Memory

Understanding and Analysis of B+ Trees on NVM Towards Consistency and Efficiency

TMO: transparent memory offloading in datacenters

Improving Performance of Flash Based Key-Value Stores Using Storage Class Memory As a Volatile Memory Extension

DyTAN: Dynamic Ternary Content Addressable Memory Using Nanoelectromechanical Relays.

Data Compression for Analytics over Large-scale In-memory Column Databases

Treatment of eosinophilic pustular folliculitis (Ofuji's disease) with tacrolimus ointment

Extending Memory Capacity in Consumer Devices with Emerging Non-Volatile Memory: An Experimental Study