HUMAP: Hierarchical Uniform Manifold Approximation and Projection

Wilson E. Marcílio-Jr,Danilo M. Eler,Fernando V. Paulovich,Rafael M. Martins
DOI: https://doi.org/10.1109/TVCG.2024.3471181
2024-10-01
Abstract:Dimensionality reduction (DR) techniques help analysts to understand patterns in high-dimensional spaces. These techniques, often represented by scatter plots, are employed in diverse science domains and facilitate similarity analysis among clusters and data samples. For datasets containing many granularities or when analysis follows the information visualization mantra, hierarchical DR techniques are the most suitable approach since they present major structures beforehand and details on demand. This work presents HUMAP, a novel hierarchical dimensionality reduction technique designed to be flexible on preserving local and global structures and preserve the mental map throughout hierarchical exploration. We provide empirical evidence of our technique's superiority compared with current hierarchical approaches and show a case study applying HUMAP for dataset labelling.
Machine Learning,Graphics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of traditional dimensionality reduction techniques when dealing with high - dimensional data sets, especially when the data set contains multiple granularity levels or the analysis needs to follow the principles of information visualization. Specifically, traditional dimensionality reduction techniques usually operate at only one level of detail, focusing on providing an overall view to describe the entire data set while ignoring the fine - grained details of the intra - cluster distribution. This has led to the important differences within the clusters being hidden, affecting a deeper understanding and analysis of the data. To solve these problems, the paper proposes HUMAP (Hierarchical Uniform Manifold Approximation and Projection), which is a new hierarchical dimensionality reduction technique. HUMAP aims to flexibly preserve the local and global structures and maintain the mental map throughout the hierarchical exploration process, that is, the user can maintain the cognitive consistency of the data structure when navigating between different levels. In this way, HUMAP can not only reveal the complex structures existing in the data set but also maintain the coherence of these structures when the user deeply explores the hierarchy, thus providing a better solution than the existing hierarchical dimensionality reduction methods.