Robust Partial Reference-Free Cell Composition Estimation from Tissue Expression

Ziyi Li,Zhenxing Guo,Ying Cheng,Peng Jin,Hao Wu
DOI: https://doi.org/10.1093/bioinformatics/btaa184
IF: 5.8
2020-01-01
Bioinformatics
Abstract:MOTIVATION In the analysis of high throughput omics data from tissue samples, estimating and accounting for cell composition have been recognized as important steps. High cost, intensive labor requirements and technical limitations hinder the cell composition quantification using cell sorting or single-cell technologies. Computational methods for cell composition estimation are available, but they are either limited by the availability of a reference panel or suffer from low accuracy. RESULTS We introduce TOAST/-P and TOAST/+P, two partial reference-free algorithms for estimating cell composition of heterogeneous tissues based on their gene expression profiles. TOAST/-P and TOAST/+P incorporate additional biological information, including cell type specific markers and prior knowledge of compositions, in the estimation procedure. Extensive simulation studies and real data analyses demonstrate that the proposed methods provide more accurate and robust cell composition estimation than existing methods. AVAILABILITY The proposed methods TOAST/-P and TOAST/+P are implemented as part of the R/Bioconductor package TOAST at https://bioconductor.org/packages/TOAST. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
What problem does this paper attempt to address?