HIERVAR: A Hierarchical Feature Selection Method for Time Series Analysis

Alireza Keshavarzian,Shahrokh Valaee
2024-07-23
Abstract:Time series classification stands as a pivotal and intricate challenge across various domains, including finance, healthcare, and industrial systems. In contemporary research, there has been a notable upsurge in exploring feature extraction through random sampling. Unlike deep convolutional networks, these methods sidestep elaborate training procedures, yet they often necessitate generating a surplus of features to comprehensively encapsulate time series nuances. Consequently, some features may lack relevance to labels or exhibit multi-collinearity with others. In this paper, we propose a novel hierarchical feature selection method aided by ANOVA variance analysis to address this challenge. Through meticulous experimentation, we demonstrate that our method substantially reduces features by over 94% while preserving accuracy -- a significant advancement in the field of time series analysis and feature selection.
Machine Learning,Information Theory
What problem does this paper attempt to address?
This paper aims to address the issue of feature redundancy in time series classification. Specifically, the paper proposes a new hierarchical feature selection method (HIERV AR), which reduces the number of random features by combining ANOVA variance analysis, thereby improving the efficiency and accuracy of the model. This method can significantly reduce the number of features while retaining classification accuracy, especially when dealing with a large number of randomly generated features. Experimental results show that HIERV AR can reduce the number of features by more than 94% and outperforms other feature selection techniques on multiple benchmark datasets. Additionally, this method enhances the interpretability of the model and reduces the risk of overfitting.