Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes

David Thybert,Maša Roller,Fábio C.P. Navarro,Ian Fiddes,Ian Streeter,Christine Feig,David Martin-Galvez,Mikhail Kolmogorov,Václav Janoušek,Wasiu Akanni,Bronwen Aken,Sarah Aldridge,Varshith Chakrapani,William Chow,Laura Clarke,Carla Cummins,Anthony Doran,Matthew Dunn,Leo Goodstadt,Kerstin Howe,Matthew Howell,Ambre-Aurore Josselin,Robert C. Karn,Christina M. Laukaitis,Lilue Jingtao,Fergal Martin,Matthieu Muffato,Michael A. Quail,Cristina Sisu,Mario Stanke,Klara Stefflova,Cock Van Oosterhout,Frederic Veyrunes,Ben Ward,Fengtang Yang,Golbahar Yazdanifar,Amonida Zadissa,David Adams,Alvis Brazma,Mark Gerstein,Benedict Paten,Son Pham,Thomas Keane,Duncan T Odom,Paul Flicek
DOI: https://doi.org/10.1101/158659
2017-07-02
Abstract:ABSTRACT Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 to 6 MYA, but that are absent in the Hominidae. In fact, Hominidae show between four-and seven-fold lower rates of nucleotide change and feature turnover in both neutral and functional sequences suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. For example, recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli . This process resulted in thousands of novel, species-specific CTCF binding sites. Our results demonstrate that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.
What problem does this paper attempt to address?