A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Qichao Lian,Bruno Huettel,Birgit Walkemeier,Baptiste Mayjonade,Céline Lopez-Roques,Lisa Gil,Fabrice Roux,Korbinian Schneeberger,Raphael Mercier
DOI: https://doi.org/10.1038/s41588-024-01715-9
IF: 30.8
2024-04-12
Nature Genetics
Abstract:Although originally primarily a system for functional biology, Arabidopsis thaliana has, owing to its broad geographical distribution and adaptation to diverse environments, developed into a powerful model in population genomics. Here we present chromosome-level genome assemblies of 69 accessions from a global species range. We found that genomic colinearity is very conserved, even among geographically and genetically distant accessions. Along chromosome arms, megabase-scale rearrangements are rare and typically present only in a single accession. This indicates that the karyotype is quasi-fixed and that rearrangements in chromosome arms are counter-selected. Centromeric regions display higher structural dynamics, and divergences in core centromeres account for most of the genome size variations. Pan-genome analyses uncovered 32,986 distinct gene families, 60% being present in all accessions and 40% appearing to be dispensable, including 18% private to a single accession, indicating unexplored genic diversity. These 69 new Arabidopsis thaliana genome assemblies will empower future genetic research.
genetics & heredity
What problem does this paper attempt to address?