An updated reference genome sequence and annotation reveals gene losses and gains underlying naked mole-rat biology

Dustin J Sokolowski,Mihai Miclăuş,Alexander Nater,Mariela Faykoo-Martinez,Kendra Hoekzema,Philip Zuzarte,Simon Monis,Sana Akhtar Alvi,Jason Erdmann,Archana Lal Erdmann,Rathnakumar Kumaragurubaran,Jonathan Bayerl,DongAhn Yoo,Nadia Karimpour,Kyra Ungerleider,Huayun Hou,Martin Fergal,Thibaut Hourlier,Zoe A Clarke,Heidi EL Lischer,Dragos V Leordean,Yiyue Jiang,Trevor J Pugh,Ewan St. J Smith,Leanne Haggerty,Diana J Laird,Jingtao Lilue,Melissa M Holmes,Evan E Eichler,Rémy Bruggmann,Jared T Simpson,Gabriel Balmus,Michael D Wilson
DOI: https://doi.org/10.1101/2024.11.26.625329
2024-11-28
Abstract:The naked mole-rat (NMR; ) is a eusocial subterranean rodent with a highly unusual set of physiological traits that has attracted great interest amongst the scientific community. However, the genetic basis of most of these traits has not been elucidated. To facilitate our understanding of the molecular mechanisms underlying NMR physiology and behaviour, we generated a long-read chromosomal-level genome assembly of the NMR. This genome was subsequently annotated and incorporated into multiple whole genome alignments in the Ensembl database. Our long-read assembly identified thousands of repeats and genes that were previously unassembled in the NMR and improved the results of routinely used short-read sequencing-based experiments such as RNA-seq, snRNA-seq, and ATAC-seq. We identified several spermatozoa related gene losses that may underlie the unique degenerative sperm phenotype in NMRs ( , , , , , , , , and ), and an additional gene loss related to the established NK-cell absence in NMRs (PILRB). We resolved several tandem duplications in genes related to pathways underlying unique NMR adaptations including hypoxia tolerance, oxidative stress, and nervous system protection ( , , ). Lastly, we describe our ongoing efforts to generate a reference telomere-to-telomere assembly in the NMR which includes the resolution of complex gene families. This new reference genome should accelerate the discovery of the genetic underpinnings of NMR physiology and adaptation.
Genomics
What problem does this paper attempt to address?