scKWARN: Kernel-weighted-average robust normalization for single-cell RNA-seq data

Chih-Yuan Hsu,Chia-Jung Chang,Qi Liu,Yu Shyr
DOI: https://doi.org/10.1093/bioinformatics/btae008
IF: 5.8
2024-01-17
Bioinformatics
Abstract:Abstract Motivation Single-cell RNA-seq normalization is an essential step to correct unwanted biases caused by sequencing depth, capture efficiency, dropout, and other technical factors. Existing normalization methods primarily reduce biases arising from sequencing depth by modeling count-depth relationship and/or assuming a specific distribution for read counts. However, these methods may lead to over or under-correction due to presence of technical biases beyond sequencing depth and the restrictive assumption on models and distributions. Results We present scKWARN, a Kernel Weighted Average Robust Normalization designed to correct known or hidden technical confounders without assuming specific data distributions or count-depth relationships. scKWARN generates a pseudo expression profile for each cell by borrowing information from its fuzzy technical neighbors through a kernel smoother. It then compares this profile against the reference derived from cells with the same bimodality patterns to determine the normalization factor. As demonstrated in both simulated and real datasets, scKWARN outperforms existing methods in removing a variety of technical biases while preserving true biological heterogeneity. Availability and Implementation scKWARN is freely available at https://github.com/cyhsuTN/scKWARN.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?