Time-space Efficient Short-read Alignment with Inserting Gaps

Yong-jie YANG,Cheng ZHONG
DOI: https://doi.org/10.3969/j.issn.1000-1220.2019.05.018
2019-01-01
Abstract:Short-read alignment is applied widely in next-generation sequencing technology. Accurate identification of gaps is the basis for subsequent genome interpretation. The existing algorithms allowing insertion of gaps varies significantly and many performs poorly or not allows insertion of gaps at all. For sequence alignment problem that both query and reference sequences are short reads,this arti-cle,performs pairwise sequence alignment for millions of that reads by training query sequence sample data in order to find different species and the optimal number of inserting gaps matched with reads of different length. The improved short-read alignment algorithm can reduce the number of iterations of the algorithm to reduce the computation of the intermediate matrix and use vector to store inter-mediate matrix elements in the process to reduce storage space. The results for large-scale of short reads show that compared to the exist-ing algorithms,the presented algorithm can improve the alignment accuracy and reduce the execution time and required memory space.
What problem does this paper attempt to address?