Quality-Score Guided Error Correction for Short-Read Sequencing Data Using Cuda

Haixiang Shi,Bertil Schmidt,Weiguo Liu,Wolfgang Mueller-Wittig
DOI: https://doi.org/10.1016/j.procs.2010.04.125
2010-01-01
Procedia Computer Science
Abstract:Recently introduced new sequencing technologies can produce massive amounts of short-read data. Detection and correction of sequencing errors in this data is an important but time-consuming pre-processing step for de-novo genome assembly. In this paper, we demonstrate how the quality-score value associated with each base-call can be integrated in a CUDA-based parallel error correction algorithm. We show that quality-score guided error correction can improve the assembly accuracy of several datasets from the NCBI SRA (Short-Read Archive) in terms of N50-values as well as runtime. We further propose a number of improvements of to our previously published CUDA-EC algorithm to improve its runtime by a factor of up to 1.88.
What problem does this paper attempt to address?