SRNIC: A Scalable Architecture for RDMA NICs
Pingbo An,Kui Ji,Ke-Sheng Huo,Chuanxiong Guo,Tao Feng,Qingsong Ning,Tianhao Wang,Tao Ding,Zilong Wang,Wenxue Li,Peng Xie,K. Cheng,Bin Xu,Kai Chen,Shi-Min Zhang,Layong Luo,Chaoliang Zeng,Xinchen Wan,Ruiqing Feng,Xiongfei Geng,Weicheng Ling
Abstract:RDMA is expected to be highly scalable: to perform well in large-scale data center networks where packet losses are inevitable ( i.e. , high network scalability), and to support a large number of performant connections per server ( i.e. , high connection scalability). Commercial RoCEv2 NICs (RNICs) fall short on scalability as they rely on a lossless, limited-scale network fabric and support only a small number of performant connections. Recent work IRN improves the network scalability by relaxing the lossless network requirement, but the connection scalability issue remains unaddressed. In this paper, we aim to address the connection scalability challenge, while maintaining high performance and low CPU overhead as commercial RNICs, and high network scalability as IRN, by designing SRNIC, a Scalable RDMA NIC architecture. Our key insight in SRNIC is that, on-chip data structures and their memory requirements in RNICs can be minimized with careful protocol and architecture co-designs to improve connection scalability. Guided by this insight, we analyze all data structures involved in an RDMA conceptual model, and remove them as many as possible with RDMA protocol header modifications and architectural innovations, including cache-free QP scheduler and memory-free selective repeat. We implement a fully functional SRNIC prototype using FPGA. Experiments show that, SRNIC achieves 10K performant connections on chip and outperforms commercial RNICs by 18x in terms of normalized connection scalability ( i.e.
Computer Science,Engineering