Cluster-based multidimensional scaling embedding tool for data visualization

Patricia Hernández-León,Miguel A. Caro
DOI: https://doi.org/10.1088/1402-4896/ad432e
2024-05-24
Abstract:We present a new technique for visualizing high-dimensional data called cluster MDS (cl-MDS), which addresses a common difficulty of dimensionality reduction methods: preserving both local and global structures of the original sample in a single 2-dimensional visualization. Its algorithm combines the well-known multidimensional scaling (MDS) tool with the $k$-medoids data clustering technique, and enables hierarchical embedding, sparsification and estimation of 2-dimensional coordinates for additional points. While cl-MDS is a generally applicable tool, we also include specific recipes for atomic structure applications. We apply this method to non-linear data of increasing complexity where different layers of locality are relevant, showing a clear improvement in their retrieval and visualization quality.
Graphics,Materials Science
What problem does this paper attempt to address?