Hybrid PDES Simulation of HPC Networks Using Zombie Packets
Elkin Cruz-Camacho,Kevin Brown,Xin Wang,Xiongxiao Xu,Kai Shu,Zhiling Lan,Bob Ross,Christopher Carothers
DOI: https://doi.org/10.1145/3682060
IF: 0.851
2024-07-30
ACM Transactions on Modeling and Computer Simulation
Abstract:Although high-fidelity network simulations have proven to be reliable and cost-effective tools to peer into architectural questions for high-performance computing (HPC) networks, they incur a high resource cost. The time spent in simulating a single millisecond of network traffic in the highest detail can take hours, even for static, well-behaved traffic patterns such as uniform random. Surrogate models offer a significant reduction in runtime, yet they cannot serve as complete replacements and should only be used when appropriate. Thus, there is a need for hybrid modeling, where high-fidelity simulation and surrogates run side-by-side. We present a surrogate model for HPC networks in which: packets bypass the network, while the network state is left untouched, i.e , suspended. To bypass the network, we use historical data to estimate the arrival time at which every packet should be scheduled at; to suspend the network, all in-flight packets are scheduled to arrive at their destinations, and are kept in the system to awaken as zombies when switching back to high-fidelity. Speedup for a hybrid model is relative to the proportion of surrogate to high-fidelity. This light-weight surrogate obtained up to 76 × speedup. Keeping the zombies in the network showed an increase in the accuracy of the high-fidelity simulation on restart when compared to restarting the network from an empty state.
computer science, interdisciplinary applications,mathematics, applied