A Hybrid Prediction Method for Realistic Network Traffic With Temporal Convolutional Network and LSTM

Jing Bi,Xiang Zhang,Haitao Yuan,Jia Zhang,MengChu Zhou
DOI: https://doi.org/10.1109/tase.2021.3077537
IF: 6.636
2021-01-01
IEEE Transactions on Automation Science and Engineering
Abstract:Accurate and real-time prediction of network traffic can not only help system operators allocate resources rationally according to their actual business needs but also help them assess the performance of a network and analyze its health status. In recent years, neural networks have been proved suitable to predict time series data, represented by the model of a long short-term memory (LSTM) neural network and a temporal convolutional network (TCN). This article proposes a novel hybrid prediction method named SG and TCN-based LSTM (ST-LSTM) for such network traffic prediction, which synergistically combines the power of the Savitzky–Golay (SG) filter, the TCN, as well as the LSTM. ST-LSTM employs a three-phase end-to-end methodology serving time series prediction. It first eliminates noise in raw data using the SG filter, then extracts short-term features from sequences applying the TCN, and then captures the long-term dependence in the data exploiting the LSTM. Experimental results over real-world datasets demonstrate that the proposed ST-LSTM outperforms state-of-the-art algorithms in terms of prediction accuracy. Note to Practitioners—This work considers real-time and high-accuracy prediction of network traffic. It is highly important to well predict network traffic by capturing long-term dependence and effectively extracting high- and low-frequency information from time series data. Yet, it is a big challenge to achieve it because there are unstable characteristics and strong nonlinear features in the network traffic due to continuous expansion of network scale and fast emergence of new services. Current prediction methods usually have oversimplified theoretical assumptions, need significant time and memory, or suffer problems of gradient disappearance or early convergence. Thus, they fail to effectively capture the nonlinear characteristics of large-scale network sequences. This work proposes a hybrid predic-ion method named SG and TCN-based LSTM (ST-LSTM), which integrates the merits of the Savitzky–Golay filter, the temporal convolutional network (TCN), and the long short-term memory (LSTM), serving as smoothing time series, capturing short-term local features, and capturing long-term dependence, respectively. Experimental results based on the real-life dataset demonstrate that it achieves better prediction accuracy than its state-of-the-art peers, including the TCN and the LSTM. It can be readily implemented and deployed in many real-life industrial areas including smart city, edge computing, cloud computing, and data centers.
automation & control systems
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address the issue of real-time high-precision network traffic prediction. Specifically, the paper proposes a new hybrid prediction method (ST-LSTM) that combines the advantages of the Savitzky-Golay filter (SG filter), Temporal Convolutional Network (TCN), and Long Short-Term Memory network (LSTM) to improve the accuracy of network traffic prediction. #### Main Issues: 1. **Limitations of existing methods**: Traditional prediction methods are based on simplified theoretical assumptions and face issues such as high time consumption, large memory usage, and problems like gradient vanishing or early convergence when dealing with large-scale data. These methods fail to effectively capture the nonlinear characteristics of network traffic. 2. **Characteristics of network traffic**: With the continuous expansion of network scale and the rapid emergence of new services, network traffic exhibits instability and strong nonlinear characteristics, posing higher demands on prediction. #### Solution: - **Data preprocessing**: The original sequence is logarithmically transformed to approximate a normal distribution, and the SG filter is used to smooth the original sequence and eliminate noise. - **Two-step architecture**: First, TCN is used to extract short-term local features, and then LSTM is employed to capture long-term dependencies, ultimately achieving more accurate prediction results. #### Experimental validation: Extensive experiments on real datasets show that the proposed ST-LSTM method outperforms existing advanced algorithms in terms of prediction accuracy, including models that use TCN and LSTM individually.