Multiple Reference Genomes and Transcriptomes for Arabidopsis Thaliana

Xiangchao Gan,Oliver Stegle,Jonas Behr,Joshua G. Steffen,Philipp Drewe,Katie L. Hildebrand,Rune Lyngsoe,Sebastian J. Schultheiss,Edward J. Osborne,Vipin T. Sreedharan,André Kahles,Regina Bohnert,Géraldine Jean,Paul Derwent,Paul Kersey,Eric J. Belfield,Nicholas P. Harberd,Eric Kemen,Christopher Toomajian,Paula X. Kover,Richard M. Clark,Gunnar Rätsch,Richard Mott
DOI: https://doi.org/10.1038/nature10414
IF: 64.8
2011-01-01
Nature
Abstract:Genetic differences between Arabidopsis thaliana accessions underlie the plant's extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions.
What problem does this paper attempt to address?