DNA Information Storage Technology Based on Raptor Code

Zhang Shufang,Peng Kang
DOI: https://doi.org/10.3788/lop57.151701
2020-01-01
Abstract:Increasing the volume of data poses huge challenges to information storage. Deoxyribonucleic acid (DNA) has the natural advantages of long life, good stability, low maintenance rate, and high storage capacity. Further, it is recognized as a potential natural information storage medium. Therefore, a new DNA information storage plan is proposed, where Raptor codes arc used to convert the binary files into DNA base sequences and the structural characteristics of DNA arc considered to introduce quaternary RS (Rccd-Solomon) error correction code for ensuring the reliability of channel transmission. In addition, based on the guanine and cytosine base content, homopolymer screening schemes have been proposed to reduce the difficulty associated with DNA synthesis and sequencing as well as the error rate. Finally, files of different formats, such as text, images, and videos, arc encoded into the DNA base sequences after they arc passed through the storage framework, and biological experiments arc conducted to synthesize DNA for achieving information storage. The experimental results show that the average coding efficiency of each basic of the DNA storage framework based on Raptor codes is 1. 49 bit, and the DNA synthesized by biological experiments can recover files without errors and has a good information storage performance.
What problem does this paper attempt to address?