A Re-Sequencing Tool For High Mismatch-Tolerant Short Read Alignment Based On Burrows-Wheeler Transform

c h lu,c y lin,chuan yi tang
DOI: https://doi.org/10.1109/BIBMW.2010.5703860
2010-01-01
Abstract:After the reference genomes of many organisms are sequenced in this post-genetic era, it has become an extremely important issue that how to do the resequencing and assembly for individual genomes from very large amount of reads. In this paper, we will present a resequencing tool designed for the Next Generation Sequencing (NGS) data. And these data are composed of a huge amount of short reads which will be aligned onto a reference genome. We modified and implemented the algorithm of Burrows-Wheeler Transform and FM-index to build the genome index of human, and proposed an idea to segment each short read into multiple non-overlapping seeds, which let us align short reads with large Hamming distance. Finally, we used the simulated datasets and real datasets from 1000 Genome Project to demonstrate the performance of our tool on a personal computer, and compared the results with widely used tools, bowtie and SOAPv2.
What problem does this paper attempt to address?