Spatial Deconvolution of Cell Types and Cell States at Scale Utilizing TACIT

Khoa Huynh,Katarzyna M. Tyc,Bruno F. Matuck,Quinn T Easter,Aditya Pratapa,Nikhil V. Kumar,Paola Perez,Rachel Kulchar,Thomas Pranzatelli,Deiziane Souza,Theresa M. Weaver,Xufeng Qu,Luiz Alberto Valente Soares Junior,Marisa Dolhnokoff,David E. Kleiner,Stephen M. Hewitt,Luiz Fernando Ferraz da Silva,Vanderson Rocha,Blake M. Warner,Kevin M. Byrd,Jinze Liu
DOI: https://doi.org/10.1101/2024.05.31.596861
2024-06-03
Abstract:Identifying cell types and states remains a time-consuming and error-prone challenge for spatial biology. While deep learning is increasingly used, it is difficult to generalize due to variability at the level of cells, neighborhoods, and niches in health and disease. To address this, we developed TACIT, an unsupervised algorithm for cell annotation using predefined signatures that operates without training data, using unbiased thresholding to distinguish positive cells from background, focusing on relevant markers to identify ambiguous cells in multiomic assays. Using five datasets (5,000,000-cells; 51-cell types) from three niches (brain, intestine, gland), TACIT outperformed existing unsupervised methods in accuracy and scalability. Integration of TACIT-identified cell with a novel Shiny app revealed new phenotypes in two inflammatory gland diseases. Finally, using combined spatial transcriptomics and proteomics, we discover under- and overrepresented immune cell types and states in regions of interest, suggesting multimodality is essential for translating spatial biology to clinical applications.
Bioinformatics
What problem does this paper attempt to address?
This paper attempts to address the challenges of identifying cell types and states in spatial biology. Although deep - learning methods are gradually being adopted in this field, these methods are difficult to generalize due to the variability of cells, neighborhoods, and niches in health and disease. To solve these problems, the research team has developed an unsupervised algorithm named TACIT (Threshold - based Assignment of Cell Types from Multiplexed Imaging Data). TACIT uses predefined feature signatures to label cells without the need for training data, distinguishes positive cells from the background through an unbiased threshold, and focuses on relevant markers to identify ambiguous cells in multi - omics assays. This algorithm outperforms existing unsupervised methods in terms of accuracy and scalability and performs excellently when dealing with five datasets of the brain, intestine, and salivary gland. In addition, TACIT is combined with a new Shiny application, which reveals new phenotypes in two inflammatory salivary gland diseases. By combining spatial transcriptomics and proteomics, it has discovered the over - expression and under - expression of immune cell types in regions of interest, highlighting the importance of multimodality for translating spatial biology into clinical applications.