Topological learning in multiclass data sets

Christopher Griffin,Trevor Karn,Benjamin Apple
DOI: https://doi.org/10.1103/physreve.109.024131
IF: 2.707
2024-02-27
Physical Review E
Abstract:We specialize techniques from topological data analysis to the problem of characterizing the topological complexity (as defined in the body of the paper) of a multiclass data set. As a by-product, a topological classifier is defined that uses an open subcovering of the data set. This subcovering can be used to construct a simplicial complex whose topological features (e.g., Betti numbers) provide information about the classification problem. We use these topological constructs to study the impact of topological complexity on learning in feedforward deep neural networks (DNNs). We hypothesize that topological complexity is negatively correlated with the ability of a fully connected feedforward deep neural network to learn to classify data correctly. We evaluate our topological classification algorithm on multiple constructed and open-source data sets. We also validate our hypothesis regarding the relationship between topological complexity and learning in DNN's on multiple data sets. https://doi.org/10.1103/PhysRevE.109.024131 ©2024 American Physical Society
physics, fluids & plasmas, mathematical
What problem does this paper attempt to address?