Enhancing RNA-seq bias mitigation with the Gaussian self-benchmarking framework: towards unbiased sequencing data

Qiang Su,Yi Long,Deming Gou,Junmin Quan,Qizhou Lian
DOI: https://doi.org/10.1186/s12864-024-10814-0
IF: 4.547
2024-10-01
BMC Genomics
Abstract:RNA sequencing is a vital technique for analyzing RNA behavior in cells, but it often suffers from various biases that distort the data. Traditional methods to address these biases are typically empirical and handle them individually, limiting their effectiveness. Our study introduces the Gaussian Self-Benchmarking (GSB) framework, a novel approach that leverages the natural distribution patterns of guanine (G) and cytosine (C) content in RNA to mitigate multiple biases simultaneously. This method is grounded in a theoretical model, organizing k-mers based on their GC content and applying a Gaussian model for alignment to ensure empirical sequencing data closely match their theoretical distribution.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?