Unsupervised curve clustering using wavelets

Umberto Amato,Anestis Antoniadis,Italia De Feis,Irène Gijbels
DOI: https://doi.org/10.1007/s11634-024-00612-7
2024-11-22
Advances in Data Analysis and Classification
Abstract:Clustering univariate functional data is mostly based on projecting the curves onto an adequate basis and applying some distance or similarity models on the coefficients. The basis functions should be chosen depending on features of the function being estimated. Commonly used are Fourier, polynomial and splines, but these may not be well suited for curves that exhibit inhomogeneous behavior. Wavelets on the contrary are well suited for identifying highly discriminant local time and scale features, and are able to adapt to the data smoothness. In recent years, few methods, relying on wavelet-based similarity measures, have been proposed for clustering curves, observed on equidistant points. In this work, we present a non-equidistant design wavelet based method for non-parametrically estimating and clustering a large number of curves. The method consists of several crucial stages: fitting functional data by non-equispaced design wavelet regression, screening out nearly flat curves, denoising the remaining curves with wavelet thresholding, and finally clustering the denoised curves. Simulation studies compare our proposed method with some other functional clustering methods. The method is applied for clustering some real functional data profiles.
statistics & probability
What problem does this paper attempt to address?