Highly Accurate Fluorogenic DNA Sequencing with Information Theory–based Error Correction

Zitian Chen,Wenxiong Zhou,Shuo Qiao,Li Kang,Haifeng Duan,X Sunney Xie,Yanyi Huang
DOI: https://doi.org/10.1038/nbt.3982
IF: 46.9
2017-01-01
Nature Biotechnology
Abstract:Eliminating errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly improve sequencing accuracy by combining fluorogenic sequencing-by-synthesis (SBS) with an information theory-based error-correction algorithm. ECC embeds redundancy in sequencing reads by creating three orthogonal degenerate sequences, generated by alternate dual-base reactions. This is similar to encoding and decoding strategies that have proved effective in detecting and correcting errors in information communication and storage. We show that, when combined with a fluorogenic SBS chemistry with raw accuracy of 98.1%, ECC sequencing provides single-end, error-free sequences up to 200 bp. ECC approaches should enable accurate identification of extremely rare genomic variations in various applications in biology and medicine.
What problem does this paper attempt to address?