Enhancing Time Series Clustering by Incorporating Multiple Distance Measures with Semi-Supervised Learning

Jing Zhou,Shan-Feng Zhu,Xiaodi Huang,Yanchun Zhang
DOI: https://doi.org/10.1007/s11390-015-1565-7
IF: 1.871
2015-01-01
Journal of Computer Science and Technology
Abstract:Time series clustering is widely applied in various areas. Existing researches focus mainly on distance measures between two time series, such as dynamic time warping (DTW) based methods, edit-distance based methods, and shapelets-based methods. In this work, we experimentally demonstrate, for the first time, that no single distance measure performs significantly better than others on clustering datasets of time series where spectral clustering is used. As such, a question arises as to how to choose an appropriate measure for a given dataset of time series. To answer this question, we propose an integration scheme that incorporates multiple distance measures using semi-supervised clustering. Our approach is able to integrate all the measures by extracting valuable underlying information for the clustering. To the best of our knowledge, this work demonstrates for the first time that the semi-supervised clustering method based on constraints is able to enhance time series clustering by combining multiple distance measures. Having tested on clustering various time series datasets, we show that our method outperforms individual measures, as well as typical integration approaches.
What problem does this paper attempt to address?