Abstract:Emerging reconfigurable data centers introduce unprecedented flexibility in how the physical layer can be programmed to adapt to current traffic demands. These reconfigurable topologies are commonly hybrid, consisting of static and reconfigurable links, enabled by e.g. an Optical Circuit Switch (OCS) connected to top-of-rack switches in Clos networks. Even though prior work has showcased the practical benefits of hybrid networks, several crucial performance aspects are not well understood. For example, many systems enforce artificial segregation of the hybrid network parts, leaving money on the table. In this paper, we study the algorithmic problem of how to jointly optimize topology and routing in reconfigurable data centers, in order to optimize a most fundamental metric, maximum link load. The complexity of reconfiguration mechanisms in this space is unexplored at large, especially for the following cross-layer network-design problem: given a hybrid network and a traffic matrix, jointly design the physical layer and the flow routing in order to minimize the maximum link load. We chart the corresponding algorithmic landscape in our work, investigating both un-/splittable flows and (non-)segregated routing policies. A topological complexity classification of the problem reveals NP-hardness in general for network topologies that are trees of depth at least two, in contrast to the tractability on trees of depth one. We moreover prove that the problem is not submodular for all these routing policies, even in multi-layer trees. However, networks that can be abstracted by a single packet switch (e.g., nonblocking Fat-Tree topologies) can be optimized efficiently, and we present optimal polynomial-time algorithms accordingly. We complement our theoretical results with trace-driven simulation studies, where our algorithms can significantly improve the network load in comparison to the state of the art.

A Lightweight Routing Layer Using a Reliable Link-Layer Protocol

RFT: Toward Highly Reliable Flow Data Transmission in Network Measurement

RIFFLE: A Distributed Network Operating System with Re-configurability for Multiple Domains

A Longitudinal Heterogeneous Scheme Based on Intelligent Resilient Framework

Minimizing Buffer Utilization for Lossless Inter-DC Links

Survey on Link Layer Congestion Management of Lossless Switching Fabric.

Lightning: A Practical Building Block for RDMA Transport Control

SRNIC: A Scalable Architecture for RDMA NICs

Switch-Assistant Loss Recovery for RDMA Transport Control

A Fast RDMA Offload Method for Unreliable Interconnection Networks

RoUD: Scalable RDMA over UD in Lossy Data Center Networks

NegotiaToR: Towards A Simple Yet Effective On-demand Reconfigurable Datacenter Network

Bifrost: Extending RoCE for Long Distance Inter-DC Links

Towards connection-scalable RNIC architecture

Using Direct Interconnection Networks For Load-Balanced Architecture

Anole: Scheduling Flows for Fast Datacenter Networks with Packet Re-Prioritization

A Flit-Level Speedup Scheme for Network-on-chips Using Self-Reconfigurable Bi-Directional Channels

Load-Optimization in Reconfigurable Data-Center Networks: Algorithms and Complexity of Flow Routing

Re-architecting Congestion Management in Lossless Ethernet.

LEFT: LightwEight and FasT Packet Reordering for RDMA

A Physical-Layer Collision Awared All-Optical Time Slice Routing Optimization Method for High Reliable Low-Latency Communication in Transmission and Computing Resource Integration Networks