Efficient and low-complexity variable-to-variable length coding for DNA storage

Yunfei Gao,Albert No
DOI: https://doi.org/10.1186/s12859-024-05943-y
IF: 3.307
2024-10-03
BMC Bioinformatics
Abstract:Efficient DNA-based storage systems offer substantial capacity and longevity at reduced costs, addressing anticipated data growth. However, encoding data into DNA sequences is limited by two key constraints: 1) a maximum of h consecutive identical bases (homopolymer constraint h ), and 2) a GC ratio between (GC content constraint ). Sequencing or synthesis errors tend to increase when these constraints are violated.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?