Wavelet-Based Density Estimation for Persistent Homology

Konstantin Häberle,Barbara Bravi,Anthea Monod
2024-04-23
Abstract:Persistent homology is a central methodology in topological data analysis that has been successfully implemented in many fields and is becoming increasingly popular and relevant. The output of persistent homology is a persistence diagram -- a multiset of points supported on the upper half plane -- that is often used as a statistical summary of the topological features of data. In this paper, we study the random nature of persistent homology and estimate the density of expected persistence diagrams from observations using wavelets; we show that our wavelet-based estimator is optimal. Furthermore, we propose an estimator that offers a sparse representation of the expected persistence diagram that achieves near-optimality. We demonstrate the utility of our contributions in a machine learning task in the context of dynamical systems.
Statistics Theory
What problem does this paper attempt to address?
The paper investigates the use of wavelet estimation methods to study and estimate the density of expected persistence diagrams in persistent homology. Persistent homology is a key tool in topological data analysis for analyzing the topological features of data. The paper points out that due to the randomness of persistence diagrams, non-parametric density estimation methods are needed to understand their distribution behavior. The authors propose an estimator based on the Haar wavelet and prove that this estimator is minimax under certain assumptions. In addition, they also propose a sparse representation threshold Haar wavelet estimator, which has computational advantages in practical applications and approaches the optimal minimax rate. The paper verifies these theories through numerical experiments and applies them to machine learning tasks in dynamic systems.