Unveiling the Power of High-Dimensional Cytometry Data with cy

Charlotte Kroeger,Sophie Müller,Jacqueline Leidner,Theresa Kröber,Stefanie Warnat-Herresthal,Jannis Bastian Spintge,Timo Zajac,Aleksej Frolov,Caterina Carraro,Simone Puccio,Joachim L Schultze,Tal Pecht,Marc D Beyer,Lorenzo Bonaguro
DOI: https://doi.org/10.1101/2024.02.29.582727
2024-03-03
Abstract:High-dimensional cytometry (HDC) is a powerful technology for studying single-cell phenotypes in complex biological systems. Although technological developments and affordability have made HDC broadly available in recent years, technological advances were not coupled with an adequate development of analytical methods that can take full advantage of the complex data generated. While several analytical platforms and bioinformatics tools have become available for the analysis of HDC data, these are either web-hosted with limited scalability or designed for expert computational biologists, making their use unapproachable for wet lab scientists. Additionally, end-to-end HDC data analysis is further hampered due to missing unified analytical ecosystems, requiring researchers to navigate multiple platforms and software packages to complete the analysis. To bridge this data analysis gap in HDC we developed , an computational framework covering not only all essential steps of cytometry data analysis but also including an array of downstream functions and tools to expand the biological interpretation of the data. The comprehensive suite of features of , including guided pre-processing, clustering, dimensionality reduction, and machine learning algorithms, facilitates the seamless integration of into clinically relevant settings, where scalability and disease classification are paramount for the widespread adoption of HDC in clinical practice. Additionally, the advanced analytical features of , such as pseudotime analysis and batch integration, provide researchers with the tools to extract deeper insights from their data. We used on a variety of data from different tissues and technologies demonstrating its versatility to assist the analysis of high dimensionality data from preprocessing to biological interpretation.
Bioinformatics
What problem does this paper attempt to address?