Deterministic and Probabilistic P4-Enabled Lightweight In-Band Network Telemetry

Konstantinos Papadopoulos,Panagiotis Papadimitriou,Chrysa Papagianni
DOI: https://doi.org/10.1109/TNSM.2023.3301839
2024-04-10
Abstract:In-band network telemetry (INT), empowered by programmable dataplanes such as P4, comprises a viable approach to network monitoring and telemetry analysis. However, P4-INT as well as other existing frameworks for INT yield a substantial transmission overhead, which grows linearly with the number of hops and the number of telemetry values. To address this issue, we present a deterministic and a probabilistic technique for lightweight INT, termed as DLINT and PLINT,respectively. In particular, DLINT exercises per-flow aggregation by spreading the telemetry values across the packets of a flow. DLINT relies on switch coordination through the use of per-flow telemetry states, maintained within P4 switches. Furthermore, DLINT utilizes Bloom Filters (BF) in order to compress the state lookup tables within P4 switches. On the other hand, PLINT employs a probabilistic approach based on reservoir sampling. PLINT essentially empowers every INT node to insert telemetry values with equal probability within each packet. Our evaluation results corroborate that both proposed techniques alleviate the transmission overhead of P4-INT, while maintaining a high degree of monitoring accuracy. In addition, we perform a comparative evaluation between DLINT and PLINT. DLINT is more effective in conveying path traces to the telemetry server, whereas PLINT detects more promptly path updates exploiting its more efficient INT header space utilization
Networking and Internet Architecture
What problem does this paper attempt to address?
The paper aims to address the transmission overhead issue of In-band Network Telemetry (INT) technology in 5G and beyond network services. Specifically: 1. **Problem Background**: With the development of 5G and beyond network services, there is a need for precise network performance monitoring to timely detect and analyze application performance degradation related to network failures, link interruptions, router configuration errors, and congestion. Current network telemetry methods, such as control plane intervention or packet mirroring to servers for analysis, have significant communication overhead or additional infrastructure requirements. 2. **Limitations of Existing Technologies**: Although the P4-based INT framework can directly obtain telemetry information from the data plane, its method of encoding per-hop information leads to a linear increase in transmission overhead. This not only wastes bandwidth but also reduces throughput and increases flow completion time. 3. **Research Objectives**: This paper proposes two lightweight INT technologies—Deterministic Lightweight INT (DLINT) and Probabilistic Lightweight INT (PLINT)—aiming to alleviate the transmission overhead issues brought by traditional P4-INT. By distributing telemetry values across multiple packets, both methods can significantly reduce the amount of telemetry information required per packet, thereby improving monitoring accuracy while reducing transmission overhead. In summary, the paper is primarily dedicated to developing more efficient network telemetry technologies to achieve higher precision and lower resource consumption in network monitoring.