Chromosomal-Level Genome Assembly Of The Sea Urchin Lytechinus Variegatus Substantially Improves Functional Genomic Analyses

Phillip L. Davidson,Haobing Guo,Lingyu Wang,Alejandro Berrio,He Zhang,Yue Chang,Andrew L. Soborowski,David R. Mcclay,Guangyi Fan,Gregory A. Wray
DOI: https://doi.org/10.1093/gbe/evaa101
2020-01-01
Genome Biology and Evolution
Abstract:Lytechinus variegatus is a camarodont sea urchin found widely throughout the western Atlantic Ocean in a variety of shallow-water marine habitats. Its distribution, abundance, and amenability to developmental perturbation make it a popular model for ecologists and developmental biologists. Here, we present a chromosomal-level genome assembly of L. variegatus generated from a combination of PacBio long reads, 10x Genomics sequencing, and HiC chromatin interaction sequencing. We show L. variegatus has 19 chromosomes with an assembly size of 870.4Mb. The contiguity and completeness of this assembly are reflected by a scaffold length N50 of 45.5Mb and BUSCO completeness score of 95.5%. Ab initio and transcript-informed gene modeling and annotation identified 27,232 genes with an average gene length of 12.6 kb, comprising an estimated 39.5% of the genome. Repetitive regions, on the other hand, make up 45.4% of the genome. Physical mapping of well-studied developmental genes onto each chromosome reveals nonrandom spatial distribution of distinct genes and gene families, which provides insight into how certain gene families may have evolved and are transcriptionally regulated in this species. Lastly, aligning RNA-seq and ATAC-seq data onto this assembly demonstrates the value of highly contiguous, complete genome assemblies for functional genomics analyses that is unattainable with fragmented, incomplete assemblies. This genome will be of great value to the scientific community as a resource for genome evolution, developmental, and ecological studies of this species and the Echinodermata.
What problem does this paper attempt to address?