Pollock: fishing for cell states

Erik P. Storrs,Daniel Cui Zhou,Michael C. Wendl,Matthew A. Wyczalkowski,Alla Karpova,Liang-Bo Wang,Yize Li,Austin Southard-Smith,Reyka G. Jayasinghe,Lijun Yao,Ruiyang Liu,Yige Wu,Nadezhda Terekhanova,Houxiang Zhu,John M. Herndon,Sid Puram,Feng Chen,William E. Gillanders,Ryan C. Fields,Li Ding
DOI: https://doi.org/10.1093/bioadv/vbac028
2022-01-01
Bioinformatics Advances
Abstract:Motivation The use of single-cell methods is expanding at an ever-increasing rate. While there are established algorithms that address cell classification, they are limited in terms of cross platform compatibility, reliance on the availability of a reference dataset and classification interpretability. Here, we introduce Pollock, a suite of algorithms for cell type identification that is compatible with popular single-cell methods and analysis platforms, provides a set of pretrained human cancer reference models, and reports interpretability scores that identify the genes that drive cell type classifications.Results Pollock performs comparably to existing classification methods, while offering easily deployable pretrained classification models across a wide variety of tissue and data types. Additionally, it demonstrates utility in immune pan-cancer analysis.Availability and implementation Source code and documentation are available at https://github.com/ding-lab/pollock. Pretrained models and datasets are available for download at https://zenodo.org/record/5895221.Supplementary information are available at Bioinformatics Advances online.
What problem does this paper attempt to address?