An Improved Chromosome-scale Genome Assembly and Population Genetics resource for Populus tremula

Kathryn M Robinson,Bastian Schiffthaler,Hui Liu,Sara M Rydman,Martha Rendón-Anaya,Teitur Ahlgren Kalman,Vikash Kumar,Camilla Canovi,Carolina Bernhardsson,Nicolas Delhomme,Jerry Jenkins,Jing Wang,Niklas Mähler,Kerstin H Richau,Victoria Stokes,Stuart A'Hara,Joan Cottrell,Kizi Coeck,Tim Diels,Klaas Vandepoele,Chanaka Mannapperuma,Eung-Jun Park,Stephane Plaisance,Stefan Jansson,Pär K Ingvarsson,Nathaniel R Street
DOI: https://doi.org/10.1111/ppl.14511
Abstract:Aspen (Populus tremula L.) is a keystone species and a model system for forest tree genomics. We present an updated resource comprising a chromosome-scale assembly, population genetics and genomics data. Using the resource, we explore the genetic basis of natural variation in leaf size and shape, traits with complex genetic architecture. We generated the genome assembly using long-read sequencing, optical and high-density genetic maps. We conducted whole-genome resequencing of the Umeå Aspen (UmAsp) collection. Using the assembly and re-sequencing data from the UmAsp, Swedish Aspen (SwAsp) and Scottish Aspen (ScotAsp) collections we performed genome-wide association analyses (GWAS) using Single Nucleotide Polymorphisms (SNPs) for 26 leaf physiognomy phenotypes. We conducted Assay of Transposase Accessible Chromatin sequencing (ATAC-Seq), identified genomic regions of accessible chromatin, and subset SNPs to these regions, improving the GWAS detection rate. We identified candidate long non-coding RNAs in leaf samples, quantified their expression in an updated co-expression network, and used this to explore the functions of candidate genes identified from the GWAS. A GWAS found SNP associations for seven traits. The associated SNPs were in or near genes annotated with developmental functions, which represent candidates for further study. Of particular interest was a ~177-kbp region harbouring associations with several leaf phenotypes in ScotAsp. We have incorporated the assembly, population genetics, genomics, and GWAS data into the PlantGenIE.org web resource, including updating existing genomics data to the new genome version, to enable easy exploration and visualisation. We provide all raw and processed data to facilitate reuse in future studies.
What problem does this paper attempt to address?