An Atlas of Cellular Archetypes

George Crowley,Uri Alon,Stephen R. Quake
DOI: https://doi.org/10.1101/2024.12.04.626890
2024-12-04
Abstract:Single cell RNA sequencing allows effective characterization of cell types and cell states. Cell types can be reliably identified as defined clusters via marker gene expression and differential expression analysis. However, once a cell type is identified, current analysis methods provide little information about the potential variety of functions within the cell type. Here, we apply a principled computational approach to understand the functional heterogeneity within a given cell type. It has been hypothesized that cells in competitive environments will conform to Pareto task optimality theory by organizing in low-dimensional polytopes in gene expression space, and that specialist phenotypes identified via Pareto Task Inference Analysis would be reproducible and general. We used the Tabula Sapiens atlas of single-cell RNA sequencing across cell types and tissues in the human body to test this hypothesis and discovered that most cell types are well-fit by this theory. Furthermore, we derived specialist phenotypes from this method, which were used to identify archetypes of cellular function that represent essential biological tasks. We compared the specialist phenotypes from common cell types shared across tissues and found consistent results. These phenotypes are derived from an unbiased approach and do not incorporate ideas from existing biological models or theories, and yet in many cases they recapitulate our understanding of the functions of major cell types. Taken together, these results suggest a principled approach to annotating the continuum of functions within each cell type.
Molecular Biology
What problem does this paper attempt to address?