FIDE: Fast and Interpretable 2D Embedding with Correlation, Distance, and Rank Considerations.

Nikolaos M. Freris,Chuhan Yang,Michalis Vlachos
DOI: https://doi.org/10.1109/bigcom51056.2020.00014
2020-01-01
Abstract:FIDE is a method for embedding high-dimensional datasets on the Euclidean plane that seeks not only to preserve pairwise distances and correlations but also to provide an easily interpretable embedding on two dimensions. To enhance the interpretability of the resulting embedding we assume (but not require) the presence of rank or importance for each object. This information can be used to guide the embedding in such a way that more important objects are placed "higher" in one of the embedding dimensions. To solve this multi-objective problem, we formalize the preservation of distances/correlations and ranks through a linear system of equations. We present a randomized iterative method for solving the linear system in the least-squares sense. The method is distributed, in that only information pertaining to neighboring data points is used at each iteration. We compare our method with other prevalent embedding and visualization techniques (t-SNE, LLE, ISOMAP, MDS, UMAP) on real-life datasets from a wide range of application domains and showcase the merits of our approach in terms of accurate relationship preservation, particularly with respect to the high interpretability of the resulting embedding.
What problem does this paper attempt to address?