WebAtlas pipeline for integrated single-cell and spatial transcriptomic data

Tong Li,David Horsfall,Daniela Basurto-Lozada,Kenny Roberts,Martin Prete,John E. G. Lawrence,Peng He,Elisabeth Tuck,Josh Moore,Aybuke Kupcu Yoldas,Kolawole Babalola,Matthew Hartley,Shila Ghazanfar,Sarah A. Teichmann,Muzlifah Haniffa,Omer Ali Bayraktar
DOI: https://doi.org/10.1038/s41592-024-02371-x
IF: 48
2024-08-21
Nature Methods
Abstract:Multimodal tissue atlasing datasets pose two key challenges for online dissemination and equitable access. First, single-cell RNA-sequencing (scRNA-seq) and spatial transcriptomics data objects are often saved in non-unified sequencing and imaging file formats that perform poorly with web technologies. Second, existing software platforms do not readily support simultaneous browsing of multiple integrated data modalities. To address these challenges, we provide 1) a new data ingestion pipeline to convert and unify datasets from multiple single-cell and spatial technologies into the cloud-ready Zarr format 1 (Fig. 1b) and 2) a front-end web client based on the Vitessce framework 2 for interactive exploration and cross-query of gene expression and cell types across modalities (Fig. 1d). WebAtlas allows bioinformaticians and software engineers to build public-facing data portals, as well as non-technical community members to access tissue atlases. (See Supplementary Note 1 for detailed comparison to other platforms.)
biochemical research methods
What problem does this paper attempt to address?