A Spectral Method for Assessing and Combining Multiple Data Visualizations-Reference-Cited by-同舟云学术

A Spectral Method for Assessing and Combining Multiple Data Visualizations

Published:2022-10-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Ma Rong,Sun Eric D.,Zou James

Abstract

AbstractDimension reduction and data visualization aim to project a high-dimensional dataset to a low-dimensional space while capturing the intrinsic structures in the data. It is an indispensable part of modern data science, and many dimensional reduction and visualization algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it critically important to evaluate their relative performance for a given dataset, and to leverage and combine their individual strengths. In this paper, we propose an efficient spectral method for assessing and combining multiple visualizations of a given dataset produced by diverse algorithms. The proposed method provides a quantitative measure – the visualization eigenscore – of the relative performance of the visualizations for preserving the structure around each data point. Then it leverages the eigenscores to obtain a consensus visualization, which has much improved quality over the individual visualizations in capturing the underlying true data structure. Our approach is flexible and works as a wrapper around any visualizations. We analyze multiple simulated and real-world datasets from diverse applications to demonstrate the effectiveness of the eigenscores for evaluating visualizations and the superiority of the proposed consensus visualization. Furthermore, we establish rigorous theoretical justification of our method based on a general statistical framework, yielding fundamental principles behind the empirical success of consensus visualization along with practical guidance.

Publisher

Cold Spring Harbor Laboratory

Reference51 articles.

1. Abraham, I. , Y. Bartal , and O. Neiman (2006). Advances in metric embedding theory. In Proceedings of the thirty-eighth annual ACM symposium on Theory of computing, pp. 271–286.

2. Abraham, I. , Y. Bartal , and O. Neiman (2009). On low dimensional local embeddings. In Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 875–884. SIAM.

3. Arora, S. , W. Hu , and P. K. Kothari (2018). An analysis of the t-SNE algorithm for data visualization. In Conference on Learning Theory, pp. 1455–1462. PMLR.

4. Bartal, Y. , N. Fandina , and O. Neiman (2019). Dimensionality reduction: theoretical perspective on practical measures. Advances in Neural Information Processing Systems 32.

5. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation