Clustering Time Series Utilizing A Dimension Hierarchical Decomposition Approach

Qiuhong Li,Peng Wang,Yang Wang,Wei Wang,Yimin Liu,Jiaye Wu,Danyang Dou
DOI: https://doi.org/10.1007/978-3-319-55753-3_16
2017-01-01
Abstract:Time series clustering has attracted amount of attention recently. However, clustering massive time series faces the challenge of the huge computation cost. To reduce the computation cost, we propose a novel Dimension Hierarchical Decomposition (DHD for short) method to represent time series and a corresponding tree structure, denoted as DHDTree, to reorganize the time series collections to achieve the best separation effect. The main idea of DHDTree is to adapt k-d tree for time series by utilizing the DHD representation. When splitting, we select the most separable splitting strategy according to a predefined cost model. A fundamental feature of DHDTree is that it overcomes dimension curse by leveraging dimension compositions instead of selecting only one dimension when splitting, aiming to acquire the maximal separation effect. We illustrate that DHDTree obtains both the balance and the locality properties, which are important factors for the efficiency of time series organization for clustering. By the support of DHDTree, we improve clustering in two aspects. First, the DHD representation decreases the computation cost between time series dramatically. Secondly, we acquire the centers benefiting from the reorganization of the time series using our proposed DHDTree structure. Both the synthetic and real data sets verify the effectiveness and efficiency of the proposed method.
What problem does this paper attempt to address?