Genome Report: The reference genome of an endangered Asteraceae, subsp. , endemic to the Central Coast of California

Susan L. McEvoy,Rachel S. Meyer,Kristen E. Hasenstab-Lehman,C. Matt Guilliams
DOI: https://doi.org/10.1101/2024.02.25.582000
2024-02-26
Abstract:We present a high-quality reference genome of the federally endangered Gaviota tarplant, subsp. (Madiinae, Asteraceae), an annual herb endemic to the Central California coast. Stewards of remaining populations have planned to apply conservation strategies informed by whole genome approaches. Generating PacBio Hifi, Oxford Nanopore Technologies, and Dovetail Omni-C data, we assembled a genome of 1.67 Gbp as 28.7 K scaffolds with a scaffold N50 of 74.9 Mb. BUSCO completeness for the final assembly was 98.1% with 15.7% duplicate copies. We annotated repeat content in 74.8% of the genome. Long terminal repeats (LTR) covered 44.0% of the genome with families predominant at 22.9% followed by at 14.2%. Both and elements were common in ancestral peaks of LTR, and the most abundant element was a element containing nested sequenced similarity, reflecting a complex evolutionary history of repeat activity. Gene annotation produced 41,039 genes and 69,563 transcripts, of which >99% were functionally annotated. BUSCO duplication rates remained very high with proteins at 50.4% complete duplicates and 46.0% single copy. Whole genome duplication (WGD) synonymous mutation rates of Gaviota tarplant and sunflower ( ) shared peaks that correspond to the last Asteraceae polyploidization event and subsequent divergence from a common ancestor at ∼27 mya. Tandem genes were twice as prevalent as WGD genes suggesting tandem genes could be an important strategy of environmental adaptation in this species.
Genomics
What problem does this paper attempt to address?