Terra: Low-latency and Reliable Event Collection in Network Measurement
Hongyan Liu,Xi Sun,Xiang Chen,Qun Huang,Dong Zhang,Haifeng Zhou,Chunming Wu,Xuan Liu,Muhammad Khurram Khan
DOI: https://doi.org/10.1016/j.jnca.2024.103904
IF: 7.574
2024-01-01
Journal of Network and Computer Applications
Abstract:Modern network measurement employs several measurement points in the substrate network. These points perform measurement tasks to measure traffic and report real-time events to monitoring servers in the control plane. These servers convert events to flow statistics and report them to network management applications, which require both low latency (i.e., collecting events within a limited time deadline) and high reliability (i.e., bounding the probability of event loss). However, existing solutions fail to satisfy the two requirements because they ignore the transmission latency and reliability when collecting events from measurement points to monitoring servers. In this paper, we propose Terra, a framework that aims to offer low-latency and reliable event collection. Terra provides a near-optimal heuristic that guides each measurement point to select monitoring servers to receive its events and corresponding network paths for transferring events with timeliness and reliability guarantee. We have implemented Terra on a 64 × 100Gbps Tofino-based switch. We conduct testbed experiments to demonstrate the effectiveness of Terra in practice. The results indicate that Terra offers low-latency and reliable event collection while preserving high application-level accuracy.