High-quality De Novo Genome Assembly of Kappaphycus Alvarezii Based on Both PacBio and HiSeq Sequencing

Shangang Jia,Guoliang Wang,Guiming Liu,Jiangyong Qu,Beilun Zhao,Xinhao Jin,Lei Zhang,Jinlong Yin,Cui Liu,Guangle Shan,Shuangxiu Wu,Liguo Song,Tao Liu,Xumin Wang,Jun Yu
DOI: https://doi.org/10.1101/2020.02.15.950402
2020-01-01
Abstract:ABSTRACT The red algae Kappaphycus alvarezii is the most important aquaculture species in Kappaphycus , widely distributed in tropical waters, and it has become the main crop of carrageenan production at present. The mechanisms of adaptation for high temperature, high salinity environments and carbohydrate metabolism may provide an important inspiration for marine algae study. Scientific background knowledge such as genomic data will be also essential to improve disease resistance and production traits of K. alvarezii . 43.28 Gb short paired-end reads and 18.52 Gb single-molecule long reads of K. alvarezii were generated by Illumina HiSeq platform and Pacbio RSII platform respectively. The de novo genome assembly was performed using Falcon_unzip and Canu software, and then improved with Pilon. The final assembled genome (336 Mb) consists of 888 scaffolds with a contig N50 of 849 Kb. Further annotation analyses predicted 21,422 protein-coding genes, with 61.28% functionally annotated. Here we report the draft genome and annotations of K. alvarezii , which are valuable resources for future genomic and genetic studies in Kappaphycus and other algae.
What problem does this paper attempt to address?