An Accelerated U-Shapelet Time Series Clustering Method with LSH

Linghao Luo,Shu Lv
DOI: https://doi.org/10.1088/1742-6596/1631/1/012077
2020-09-01
Journal of Physics: Conference Series
Abstract:Abstract The clustering method of time series is based on the measurement of the whole time series, and the traditional clustering method is used for direct clustering. It was only recently that Keough came up with the concept of Shapelet. He later applied this concept to unsupervised time series clustering. This is a time series clustering method based on the feature of sub-series. This method has good clustering accuracy and stability. However, due to the fact that traversal sequences are all sub-sequences, the original violent search algorithm is very complex and difficult to be applied to a large number of time series data sets. In this paper, on the basis of the original method, you introduce the idea of data retrieval in the data preprocessing stage to quickly screen candidate subsequences. Then we introduce a position-aware hashing algorithm -LSH. The method is used to match the sub-sequences of the pre-processed time series data, and the wide representativeness of the high-quality u-Shapelet is used for selection. Therefore, LSH-us is proposed in this paper to speed up the extraction process of feature sub-sequences. Experimental results show that this algorithm can improve the speed of u-Shapelet time series clustering algorithm and ensure the accuracy of clustering.
What problem does this paper attempt to address?