A Distributed Multi-level Composite Index for KNN Processing on Long Time Series

Xiaqing Wang,Zicheng Fang,Peng Wang,Ruiyuan Zhu,Wei Wang
DOI: https://doi.org/10.1007/978-3-319-55753-3_14
2017-01-01
Abstract:Recently, sensor-based applications have emerged and collected plenty of long time series. Traditional whole matching similarity search can only query full length time series. However, for long time series, similarity search on arbitrary time windows is more attractive and important. In this paper, we address the problem of window-based KNN search of time series data on HBase. Based on PAA approximation, we propose a composite index structure comprising Horizontal Segment Tree and Vertical Inverted Table. VI-Table is capable to prune time series by data summary in high levels, while HS-Tree leverages data summary in low levels to reduce access of the raw time series data. Both VI-Table and HS-Tree can be built parallel and incrementally. Our experiment results show the effectiveness and robustness of the proposed approach.
What problem does this paper attempt to address?