A chromosome-level, haplotype-resolved genome assembly and annotation for the Eurasian minnow (Leuciscidae: Phoxinus phoxinus) provide evidence of haplotype diversity
Temitope Opeyemi Oriowo,Ioannis Chrysostomakis,Sebastian Martin,Sandra Kukowka,Thomas Brown,Sylke Winkler,Eugene W Myers,Astrid Boehne,Madlen Stange
DOI: https://doi.org/10.1101/2023.11.30.569369
2024-09-04
Abstract:In this study we present an in-depth analysis of the Eurasian minnow (Phoxinus phoxinus) genome, highlighting its genetic diversity, structural variations, and evolutionary adaptations. We generated an annotated haplotype-phased, chromosome-level genome assembly (2n = 50) by integrating high-fidelity (HiFi) long reads and chromosome conformation capture data (Hi-C). We achieved a haploid size of 940 Megabase pairs (Mbp) for haplome one and 929 Mbp for haplome two with high scaffold N50 values of 36.4 Mb and 36.6 Mb and BUSCO scores of 96.9% and 97.2%, respectively, indicating a highly complete genome assembly. We detected notable heterozygosity (1.43%) and a high repeat content (approximately 54%), primarily consisting of DNA transposons, which contribute to genome rearrangements and variations. We found substantial structural variations within the genome, including insertions, deletions, inversions, and translocations. These variations affect genes enriched in functions such as dephosphorylation, developmental pigmentation, phagocytosis, immunity, and stress response. In the annotation of protein-coding genes, 30,980 mRNAs and 23,497 protein-coding genes were identified with a high completeness score, which further underpins the high contiguity of our genome assemblies. We performed a gene family evolution analysis by comparing our proteome to ten other teleost species, which identified immune system gene families that prioritise histone-based disease prevention over NLR-based immune responses. Additionally, demographic analysis indicates historical fluctuations in the effective population size of P. phoxinus, likely correlating with past climatic changes. This annotated, phased reference genome provides a crucial resource for resolving the taxonomic complexity within the genus Phoxinus and highlights the importance of haplotype-phased assemblies in understanding haplotype diversity in species characterised by high heterozygosity.
Genomics