Matrixdcn: A High Performance Network Architecture for Large-Scale Cloud Data Centers

Yantao Sun,Min Chen,Limei Peng,Mohammad Mehedi Hassan,Abdulhameed Alelaiwi
DOI: https://doi.org/10.1002/wcm.2579
2015-01-01
Wireless Communications and Mobile Computing
Abstract:With the widespread deployment of cloud services, data center networks are developing toward large-scale, multi-path networks. Conventional switching-oriented data center network meets difficulties in terms of scalability and flexibility to support increasing bandwidth requirements for cloud services. To solve this problem, a simple and scalable architecture, MatrixDCN, is proposed in this paper. MatrixDCN is an approximate non-blocking network, in which switches and servers are arranged in rows and columns that compose a matrix structure. A MatrixDCN network can accommodate up to hundreds of thousands of servers without bandwidth bottlenecks. Furthermore, the physical topology of a MatrixDCN network can be designed consistently with its logic topology, which helps to reduce the complexity of the management and maintenance of a data center. An efficient routing algorithm, named fault-avoidance routing FAR, is well designed for MatrixDCN to fully leverage the regularity in the topology. FAR builds two routing tables for a router. A BRT is built based on local topology, and a novel negative routing table NRT is increasingly built based on learned partial network failures, which really avoids the problem of network convergence and further shortens the calculating time of routing tables. FAR also greatly reduces the size of routing tables by introducing NRTs at routers. Theoretical analysis and simulations show that MatrixDCN has advantages on the scalability of topology, network throughput, and the performance of FAR. Copyright © 2015John Wiley & Sons, Ltd.
What problem does this paper attempt to address?