Parallel Time Series Decomposition Algorithm Based on Spark

Yong LI,Fei TENG,Qichuan HUANG,Tianrui LI
DOI: https://doi.org/10.3778/j.issn.1673-9418.1705036
2018-01-01
Abstract:This paper proposes a parallel time series decomposition model based on Spark to handle the challenges of time series decomposition in the era of big data. Specifically, the parallel algorithm consists of three steps: Firstly, a time series is split into a sequence of sub-time series while adding some redundant data into the sub-time series to protect these sub-time series being polluted. Secondly, each segmented sub-time series (with redundant information) is transferred to one work node on Spark for analyzing by using some time series decomposition algorithms. Thirdly, the redundant parts in the analysis result of each sub-time series are removed and the results are integrated. Experi-ments are conducted to evaluate the model, the results demonstrate its effectiveness and accuracy on large-scale time series datasets.
What problem does this paper attempt to address?