Abstract:In recent years, there have been unprecedented technological advances in sensor technology, and sensors have become more affordable than ever. Thus, sensor-driven data collection is increasingly becoming an attractive and practical option for researchers around the globe. Such data is typically extracted in the form of time series data, which can be investigated with data mining techniques to summarize behaviors of a range of subjects including humans and animals. While enabling cheap and mass collection of data, continuous sensor data recording results in datasets which are big in size and volume, which are challenging to process and analyze with traditional techniques in a timely manner. Such collected sensor data is typically extracted in the form of time series data. There are two main approaches in the literature, namely, shape-based classification and feature-based classification. Shape-based classification determines the best class according to a distance measure. Feature-based classification, on the other hand, measures properties of the time series and finds the best class according to the set of features defined for the time series. In this dissertation, we demonstrate that neither of the two techniques will dominate for some problems, but that some combination of both might be the best. In other words, on a single problem, it might be possible that one of the techniques is better for one subset of the behaviors, and the other technique is better for another subset of behaviors. We introduce a hybrid algorithm to classify behaviors, using both shape and feature measures, in weakly labeled time series data collected from sensors to quantify specific behaviors performed by the subject. We demonstrate that our algorithm can robustly classify real, noisy, and complex datasets, based on a combination of shape and features, and tested our proposed algorithm on real-world datasets.

On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration

Research on methodology of time serial data mining

Data Mining in Time Series: Current Study and Future Trend

A review on time series data mining

TimeSeriesBench: An Industrial-Grade Benchmark for Time Series Anomaly Detection Models

Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping

The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

A Demonstration of Benchmarking Time Series Management Systems in the Cloud

Time Series Dataset Survey for Forecasting with Deep Learning

OrionBench: Benchmarking Time Series Generative Models in the Service of the End-User

A benchmark study on time series clustering

Deep Time Series Models: A Comprehensive Survey and Benchmark

An ultra-fast time series distance measure to allow data mining in more complex real-world deployments

Experimental Comparison and Survey of Twelve Time Series Anomaly Detection Algorithms

Time Series Data Mining Algorithms Towards Scalable and Real-Time Behavior Monitoring

Performance Study of Time Series Databases

Spatiotemporal Data Mining: A Survey

Unsupervised Anomaly Detection in Time-series: An Extensive Evaluation and Analysis of State-of-the-art Methods

Less is more: Selecting the right benchmarking set of data for time series classification

Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP)

Foundation Models for Time Series Analysis: A Tutorial and Survey