Learning-aided Stochastic Network Optimization with Imperfect State Prediction.
Longbo Huang,Minghua Chen,Yunxin Liu
DOI: https://doi.org/10.1109/tnet.2018.2854593
2018-01-01
IEEE/ACM Transactions on Networking
Abstract:We investigate the problem of stochastic network optimization in the presence of imperfect state prediction and non-stationarity. Based on a novel distribution-accuracy curve prediction model, we develop the predictive learning-aided control (PLC) algorithm, which jointly utilizes historic and predicted network state information for decision making. PLC is an online algorithm that requires zero a-prior system statistical information, and consists of three key components, namely sequential distribution estimation and change detection, dual learning, and online queue-based control.Specifically, we show that PLC simultaneously achieves good long-term performance, short-term queue size reduction, accurate change detection, and fast algorithm convergence. In particular, for stationary networks, PLC achieves a near-optimal [O(epsilon), O( log(1/epsilon)(2))] utility-delay tradeoff. For non-stationary networks, PLC obtains an [O(epsilon), O(log(2) (1/epsilon) + min(epsilon(c/2-1), e(w)/epsilon))] utility-backlog tradeo. for distributions that last Theta(max(epsilon(-c),e(w)(-2))/epsilon(1+a)) time, where e(w) is the prediction accuracy and a = Theta(1) > 0 is a constant (the Backpressue algorithm [1] requires an O(epsilon(-2)) length for the same utility performance with a larger backlog). Moreover, PLC detects distribution change O(w) slots faster with high probability (w is the prediction size) and achieves an O(min(epsilon(-1+c/2), e(w)/epsilon) + log(2) (1/epsilon)) convergence time, which is faster than Backpressure and other algorithms. Our results demonstrate that state prediction (even imperfect) can help (i) achieve faster detection and convergence, and (ii) obtain be.er utility-delay tradeo.s..ey also quantify the bene.ts of prediction in four important performance metrics, i.e., utility (e.ciency), delay (quality-of-service), detection (robustness), and convergence (adaptability), and provide new insight for joint prediction, learning and optimization in stochastic networks.