Optimizing LSM-based indexes for disaggregated memory

Ruihong Wang,Chuqing Gao,Jianguo Wang,Prishita Kadam,M. TamerÖzsu,Walid G. Aref
DOI: https://doi.org/10.1007/s00778-024-00863-y
2024-06-21
The VLDB Journal
Abstract:The emerging trend of memory disaggregation where CPU and memory are physically separated from each other and are connected via ultra-fast networking, e.g., over Remote Direct Memory Access (RDMA), allows elastic and independent scaling of compute (CPU) and main memory. This paper investigates how indexing can be efficiently designed in the memory disaggregated architecture. Although existing research has optimized the B-tree for this new architecture, its performance is unsatisfactory. This paper focuses on LSM-based indexing and proposes dLSM ,the first highly optimized LSM-tree for d isaggregated memory. dLSM introduces a suite of optimizations including reducing software overhead, leveraging near-data computing, tuning for byte-addressability, and an instantiation over RDMA as a case study with RDMA-specific customizations to improve system performance. Experiments illustrate that dLSM achieves 2.3 to 11.6 higher write throughput than running the optimized B-tree and four adaptations of existing LSM-tree indexes over disaggregated memory. dLSM is written in C++ (with approximately 54,400 LOC), and is open-sourced.
computer science, information systems, hardware & architecture
What problem does this paper attempt to address?