Explainable Graph Learning for Multimodal Single-Cell Data Integration

Mehmet Burak Koca,Fatih Erdogan Sevilgen
DOI: https://doi.org/10.1101/2024.12.06.627151
2024-12-11
Abstract:Integrating multi-omic single-cell data is essential for uncovering cellular heterogeneity and identifying specialized subpopulations. However, achieving both explainable and expressive integration remains challenging due to the complex relationships between modalities. Here, we introduce Single-Cell PROteomics Vertical Integration (SCPRO-VI), a novel algorithm that integrates paired multi-omic data through similarity graph fusion, enhanced with a multi-view variational graph auto-encoder. SCPRO-VI incorporates a biologically guided distance metric and a multi-view graph-based embedding approach to capture cross-modality relations effectively. Extensive benchmark on multi-omic CITE-seq datasets shows that SCPRO-VI significantly enhances inter-cell type heterogeneity and identifies biologically meaningful sub-clusters that remain indistinguishable by existing methods. These results demonstrate robustness of SCPRO-VI and its potential to address key challenges in single-cell multi-omic data integration.
Bioinformatics
What problem does this paper attempt to address?