Data Streaming Algorithms for Accurate and Efcient Measurement of Trafc and Flow Matrices

Zhao Abhishek,Kumar Jia,Jun Wang
2005-01-01
Abstract:The trafc volume between origin/destination (OD) pairs in a network, known as trafc matrix, is essential for efcient network provisioning and trafc engineering. Existing approaches of estimating the trafc matrix, based on statistical inference and/or packet sampling, usually cannot achieve very high estimation accuracy. In this work, we take a brand new approach in attacking this problem. We propose a novel data streaming algorithm that can process trafc stream at very high speed (e.g., 40 Gbps) and produce trafc digests that are orders of magnitude smaller than the trafc stream. By correlating the digests collected at any OD pair using Bayesian statistics, the volume of trafc o wing between the OD pair can be accurately determined. We also establish principles and techniques for optimally combining this streaming method with sampling, when sampling is necessary due to stringent resource constraints. In addition, we propose another data streaming algorithm that estimates ow matrix, a ner -grained characterization than trafc matrix. Flow matrix is concerned with not only the total trafc between an OD pair (trafc matrix), but also how it splits into o ws of various sizes. Through rigorous theoretical analysis and extensive synthetic experiments on real Internet trafc, we demonstrate that these two algorithms can produce very accurate estimation of trafc matrix and o w matrix respectively.
What problem does this paper attempt to address?