Compound-SNE: Comparative alignment of t-SNEs for multiple single-cell omics data visualisation

Colin G Cess,Laleh Haghverdi
DOI: https://doi.org/10.1093/bioinformatics/btae471
IF: 5.8
2024-07-25
Bioinformatics
Abstract:Abstract Summary One of the first steps in single-cell omics data analysis is visualization, which allows researchers to see how well-separated cell-types are from each other. When visualizing multiple datasets at once, data integration/batch correction methods are used to merge the datasets. While needed for downstream analyses, these methods modify features space (e.g. gene expression)/PCA space in order to mix cell-types between batches as well as possible. This obscures sample-specific features and breaks down local embedding structures that can be seen when a sample is embedded alone. Therefore, in order to improve in visual comparisons between large numbers of samples (e.g., multiple patients, omic modalities, different time points), we introduce Compound-SNE, which performs what we term a soft alignment of samples in embedding space. We show that Compound-SNE is able to align cell-types in embedding space across samples, while preserving local embedding structures from when samples are embedded independently. Availability and Implementation Python code for Compound-SNE is available for download at https://github.com/HaghverdiLab/Compound-SNE. Supplementary Information Available online. Provides algorithmic details and additional tests.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?