The Norway spruce genome sequence and conifer genome evolution
Björn Nystedt,Nathaniel R. Street,Anna Wetterbom,Andrea Zuccolo,Yao-Cheng Lin,Douglas G. Scofield,Francesco Vezzi,Nicolas Delhomme,Stefania Giacomello,Andrey Alexeyenko,Riccardo Vicedomini,Kristoffer Sahlin,Ellen Sherwood,Malin Elfstrand,Lydia Gramzow,Kristina Holmberg,Jimmie Hällman,Olivier Keech,Lisa Klasson,Maxim Koriabine,Melis Kucukoglu,Max Käller,Johannes Luthman,Fredrik Lysholm,Totte Niittylä,Åke Olson,Nemanja Rilakovic,Carol Ritland,Josep A. Rosselló,Juliana Sena,Thomas Svensson,Carlos Talavera-López,Günter Theißen,Hannele Tuominen,Kevin Vanneste,Zhi-Qiang Wu,Bo Zhang,Philipp Zerbe,Lars Arvestad,Rishikesh Bhalerao,Joerg Bohlmann,Jean Bousquet,Rosario Garcia Gil,Torgeir R. Hvidsten,Pieter de Jong,John MacKay,Michele Morgante,Kermit Ritland,Björn Sundberg,Stacey Lee Thompson,Yves Van de Peer,Björn Andersson,Ove Nilsson,Pär K. Ingvarsson,Joakim Lundeberg,Stefan Jansson
DOI: https://doi.org/10.1038/nature12211
IF: 64.8
2013-05-01
Nature
Abstract:Conifers have dominated forests for more than 200 million years and are of huge ecological and economic importance. Here we present the draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any gymnosperm. The number of well-supported genes (28,354) is similar to the >100 times smaller genome of Arabidopsis thaliana, and there is no evidence of a recent whole-genome duplication in the gymnosperm lineage. Instead, the large genome size seems to result from the slow and steady accumulation of a diverse set of long-terminal repeat transposable elements, possibly owing to the lack of an efficient elimination mechanism. Comparative sequencing of Pinus sylvestris, Abies sibirica, Juniperus communis, Taxus baccata and Gnetum gnemon reveals that the transposable element diversity is shared among extant conifers. Expression of 24-nucleotide small RNAs, previously implicated in transposable element silencing, is tissue-specific and much lower than in other plants. We further identify numerous long (>10,000 base pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs. This opens up new genomic avenues for conifer forestry and breeding.
multidisciplinary sciences