Abstract:Memory disaggregation is a promising architecture for modern datacenters that separates compute and memory resources into independent pools connected by ultra-fast networks, which can improve memory utilization, reduce cost, and enable elastic scaling of compute and memory resources. However, existing memory disaggregation solutions based on remote direct memory access (RDMA) suffer from high latency and additional overheads including page faults and code refactoring. Emerging cache-coherent interconnects such as CXL offer opportunities to reconstruct high-performance memory disaggregation. However, existing CXL-based approaches have physical distance limitation and cannot be deployed across racks. In this article, we propose Rcmp, a novel low-latency and highly scalable memory disaggregation system based on RDMA and CXL. The significant feature is that Rcmp improves the performance of RDMA-based systems via CXL, and leverages RDMA to overcome CXL’s distance limitation. To address the challenges of the mismatch between RDMA and CXL in terms of granularity, communication, and performance, Rcmp (1) provides a global page-based memory space management and enables fine-grained data access, (2) designs an efficient communication mechanism to avoid communication blocking issues, (3) proposes a hot-page identification and swapping strategy to reduce RDMA communications, and (4) designs an RDMA-optimized RPC framework to accelerate RDMA transfers. We implement a prototype of Rcmp and evaluate its performance by using micro-benchmarks and running a key-value store with YCSB benchmarks. The results show that Rcmp can achieve 5.2× lower latency and 3.8× higher throughput than RDMA-based systems. We also demonstrate that Rcmp can scale well with the increasing number of nodes without compromising performance.

POSTER: CAVER: Enhancing RDMA Load Balancing by Hunting Less-Congested Paths

RDMA Load Balancing via Data Partition

SeqBalance: Congestion-Aware Load Balancing with no Reordering for RoCE

Achieving Low Latency for Multipath Transmission in RDMA Based Data Center Network

Maximizing the Benefit of RDMA at End Hosts

RB2: Narrow the Gap Between RDMA Abstraction and Performance Via a Middle Layer

RDMAvisor: Toward Deploying Scalable and Simple RDMA as a Service in Datacenters

A Survey of Storage Systems in the RDMA Era

LSCC: Link-Segmented Congestion Control for RDMA in Cross-Datacenter Networks

Network Load Balancing with In-network Reordering Support for RDMA

An efficient cloud-based elastic RDMA protocol for HPC applications

Toward Effective and Fair RDMA Resource Sharing.

Rcmp: Reconstructing RDMA-Based Memory Disaggregation via CXL

L2BM: Switch Buffer Management for Hybrid Traffic in Data Center Networks

RF-RPC: Remote Fetching RPC Paradigm for RDMA-Enabled Network

A Comprehensive Evaluation of RDMA-enabled Concurrency Control Protocols.

MC-RDMA: Improving Replication Performance of RDMA-based Distributed Systems with Reliable Multicast Support

Dart: Divide and Specialize for Fast Response to Congestion in RDMA-Based Datacenter Networks

RFP: When RPC is Faster than Server-Bypass with RDMA.

Lightning: A Practical Building Block for RDMA Transport Control

Error Recovery of RDMA Packets in Data Center Networks