An Empirical Algorithm for Bias Correction Based on GC Estimation for Single Cell Sequencing.

Bo Xu,Tengpeng Li,Yi Luo,Ruotao Xu,Hongmin Cai
DOI: https://doi.org/10.1007/978-3-319-13186-3_2
2014-01-01
Abstract:Whole genome amplification (WGA) have been applied to single cell copy number variations (CNVs) analysis, which is a common genomic mutation associated with various diseases and provides new insight for the fields of biology and medicine. However, the WGA-induced bias based on multiple displacement amplification (MDA) significantly limits sensitivity and specificity for CNVs detection. To address the limitations, an empirical algorithm for CNVs detection at single cell level was developed. This proposed method consists of base call amplification, alig- nment and analysis to remove the MDA-induced bias. We generated and analyzed about 50G short read data sets based on MDAsim, a software to amplify the chromosome 21 into various coverage. Simulation experiments have shown that the coverage tended to be less than average in genomic GC-enriched (>45 %) regions, implying a significant amplification bias within these regions. Base substitution error frequencies with G > A transversion is being among the most frequent and C > T, G > T transversions are among the least frequent substitution errors. The estimated substitution was employed to compensate errors to correct bias readings.
What problem does this paper attempt to address?