Barnacle: An Assembly Algorithm for Clone-based Sequences of Whole Genomes

Vicky Choi,Martin Farach-Colton
DOI: https://doi.org/10.48550/arXiv.cs/0302005
2003-02-04
Abstract:We propose an assembly algorithm {\sc Barnacle} for sequences generated by the clone-based approach. We illustrate our approach by assembling the human genome. Our novel method abandons the original physical-mapping-first framework. As we show, {\sc Barnacle} more effectively resolves conflicts due to repeated sequences. The latter is the main difficulty of the sequence assembly problem. Inaddition, we are able to detect inconsistencies in the underlying data. We present and compare our results on the December 2001 freeze of the public working draft of the human genome with NCBI's assembly (Build 28). The assembly of December 2001 freeze of the public working draft generated by {\sc Barnacle} and the source code of {\sc Barnacle} are available at (<a class="link-external link-http" href="http://www.cs.rutgers.edu/~vchoi" rel="external noopener nofollow">this http URL</a>).
Data Structures and Algorithms,Discrete Mathematics,Quantitative Biology
What problem does this paper attempt to address?