The Sequencing and De Novo Assembly of the Larimichthys Crocea Genome Using PacBio and Hi-C Technologies.

Baohua Chen,Zhixiong Zhou,Qiaozhen Ke,Yidi Wu,Huaqiang Bai,Fei Pu,Peng Xu
DOI: https://doi.org/10.1038/s41597-019-0194-3
2019-01-01
Scientific Data
Abstract:Larimichthys crocea is an endemic marine fish in East Asia that belongs to Sciaenidae in Perciformes . L . crocea has now been recognized as an “iconic” marine fish species in China because not only is it a popular food fish in China, it is a representative victim of overfishing and still provides high value fish products supported by the modern large-scale mariculture industry. Here, we report a chromosome-level reference genome of L . crocea generated by employing the PacBio single molecule sequencing technique (SMRT) and high-throughput chromosome conformation capture (Hi-C) technologies. The genome sequences were assembled into 1,591 contigs with a total length of 723.86 Mb and a contig N50 length of 2.83 Mb. After chromosome-level scaffolding, 24 scaffolds were constructed with a total length of 668.67 Mb (92.48% of the total length). Genome annotation identified 23,657 protein-coding genes and 7262 ncRNAs. This highly accurate, chromosome-level reference genome of L . crocea provides an essential genome resource to support the development of genome-scale selective breeding and restocking strategies of L . crocea .
What problem does this paper attempt to address?