An Interpretable Data Embedding under Uncertain Distance Information.

Nikolaos M. Freris,Michalis Vlachos,Ahmad Ajalloeian
DOI: https://doi.org/10.1109/icdm50108.2020.00119
2020-01-01
Abstract:A common assumption in embedding methodologies is the availability of exact pairwise distances. In this paper, we propose a 2D embedding that overcomes this limitation. It can operate on distances that are represented as a range of lower and upper bounds. Such bounds are typically available when objects are compressed, whence our approach is highly applicable in the case of big compressed datasets. We establish linear convergence (i.e., exponential decay of distance to optimality) for the proposed scheme, with a rate characterized by the topology of the data graph. We compare with prevalent embedding methodologies (ISOMAP, t-SNE, MDS) and illustrate that our approach can provide fidelitous preservation of distances, correlations, and object ranks, even in the presence of inexact distance information.
What problem does this paper attempt to address?